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Abstract 

We solve a model problem from single crystal plasticity. We con- 
sider 4 slip systems in the plane with orthogonal slip-directions and 
equal slip rates, forward as well as backwards. We compute the as- 
sociated dissipation distance by solving an optimal control problem. 
It turns out that from a computational point of view computing the 
distance is inexpensive. We put special emphasis on visualization of 
the metric spheres and the associated length-minimizing curves. 
As a byproduct we also solve a related problem, optimal path plan- 
ning for a car driving forwards and backwards with limited turning 
radius in the hyperbolic plane. This is a hyperbolic version of the 



Reeds-Shepp-Car-Problem first discussed in |17| 



1 Introduction 

In this paper we provide the solution to an optimization problem which has 
various interpretations. Although we will put special emphasis on the one 
mentioned in the title, the underlying mathematical problem does not require 
any knowledge from continuum mechanics and/or finite plasticity. Therefore 
we start with a naive formulation as an optimal factorization problem in the 
group SL(2,M) of invertible 2 by 2-matrices with determinant 1. 
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A factorization problem 

Let st(2) denote the set of 2 by 2-matrices with zero trace, and let 

So P and Q are generators for shearings along the coordinate axes in R^. 
Then it is well-known that every g e SL(2, R) may be written as a product 

g = exp{tiAi) ■ ■ ■ exp{tkAk) with k eN, tk eR, and Ak e {P, Q}. 

We want to find factorization(s) of a given g such that Y2i l^il minimal. 
Therefore we define the factorization cost T{g) as 

r{g) = ini \^\ti\ : g = exp(ti^i) . . . exp(tfc^fe), A; G N, 4 G R, 
[ * Aue{P,Q} 

The factorization cost T{g) can be interpreted as the distance of g from the 
identity matrix. It can also be used to measure distances in the group SL(2): 

-D(^o, ^i) = T {go^gi) , or D{go, gi) = T {gogi^) . 

It turns out that D, D are metrics on SL(2), by construction D is left- 
invariant while D is right- invariant: 

D{go: gi) = D{ggo , ggi), D{go, gi) = D{gog , gig) for all g, go, gi e SL(2). 

In the sequel we will solve the problem of computing T{g) through an asso- 
ciated optimal control problem. Our technique is kind of standard in control 
theory in the sense that wc use the Pontrjagin Maximum Principle (PMP) as 
a necessary condition for optimality plus some adhoc comparison arguments. 
We will also point out how the (PMP) relates to the yield surface and flow 
rule used in the plasticity literature. 

Sneak preview 

In order to give the reader an idea of the final outcome (and the computa- 
tional complexity) we state a few consequences of our final results. 

Theorem 1.1 Every g e SL(2) has an optimal factorization of the form 

g = exp(iiAi) . . . ex.p(t(iAe), with Ak e {P, Q,P + Q}, e R. 



2 



So no more than 6 factors are needed, but it is necessary to allow factors of 
the form exp(±t(P + Q)), too. Otherwise T{g) will be an infimum for some 
matrices g G SL(2), in particular for g = exp(t(P + Q)) with t ^ 0. 

As a consequence, finding an optimal factorization is reduced to a finite 
problem. We will actually obtain the following, much more detailed infor- 
mation: 

Theorem 1.2 There exists a sufficient family consisting of 64 maps, i.e., 
there exist functions /i, . . . , : — > SL(2) with the following property: 
For every g G SL(2) there exist k G {1,...,64} and r,s,t > such that 
fk{r,s,t) provides an optimal factorization of g. 

If one is really interested in computing an optimal factorization explicitly 
one can exploit the symmetry of the problem and reduce the number of maps 
that have to be inverted to 13 (rather than 64). For efficient computation of 
the function T one can even reduce this to 12 maps. 

To give a rough idea of the computational complexity we note that except 
for one map, nothing worse than solving quadratic equations is required. In 
this worst case the challenge consists of solving a cubic equation p{x) = y, 
and this needs to be done only over an x-interval where the underlying cubic 
polynomial p is strictly increasing, convex, and, p'{x) is bounded away from 0. 

Finally, we will show that for any other pair of rank-1 matrices 3^,3"^ G 
sl(2) the solution of the associated factorization problem for G {i^*^, ±3"^} 
can be obtained from T{g) in the following, very simple way: 

Theorem 1.3 Let 3\ 3^ G 5[(2) with det{3^) = det{3^) = 0, [3\ 3^] ^ 0. 

Let Ts denote the factorization cost for {±5*^, ±5*^}. Then there exist A > 
and an automorphism a: SL(2) — * SL(2) such that Ts{g) = XT{a{g)). 

We will also show how A > and a G Aut(SL(2)) arc obtained, given 
3^ .,3'^. Thus we have determined the dissipation distance for any 2-slip 
system with symmetric dissipation functonial. 

A reader's guide 

This paper serves several purposes, therefore a few remarks concerning these 
seem to be in order. 

The main purpose is to illustrate the application of optimal control tech- 
niques and Lie group methods to finite plasticity. So partly this paper is 
intended as a tutorial for non-specialists in optimal control on Lie groups. 
Therefore we will discuss everything in great detail and provide rigorous 
proofs. In this spirit this report is a successor of Sussmann's and Tang's 
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paper [|I^ on the Reeds-Shepp-Car-Problem. As our factorization problem 
is related to the Reeds-Shepp-Car-Problem in the hyperbolic plane, our ar- 
guments and results will bear some strong resemblance with those in |TB| . 



Therefore we would like to stress that in this paper we put special emphasis 
on how to exploit the Lie group structure of SL(2). The latter is instrumen- 
tal in reducing the complexity and streamlining the discussion. Moreover, 
it is indispensable if one's aim is to treat similar problems in SL(2) and, 
eventually, in SL(3). 

As the solution of the hyperbolic Reeds-Shepp-Car-Problem requires only 
little extra effort, we will provide it in an appendix. Although the result re- 
sembles that for the euclidean case, some aspects are different. For the geome- 
ter these are properties that distinguish hyperbolic from euclidean geometry. 
The interpretation as a path planning problem in the hyperbolic plane also 
provides a good visualization tool. It is noteworthy to mention that even if 
one does not care about hyperbolic geometry, one can benefit from it because 
some of the adhoc arguments suddenly have a simple interpretation — they 
might seem perfectly obscure and unmotivated, otherwise. 

Visualization of the metric spheres (i.e., level sets of the factorization cost 
T) is another issue we deal with. Since the group SL(2) is three-dimensional, 
everything can be visualized in M^, but how? We will use a parametrization 
coming from a polar decomposition, first proposed by Hilgert and Hofmann 
in [g. As a set, SL(2) is identified with x [-vr, vr) C M^^ and is identified 
with the simply connected Lie group with Lie algebra 51(2). An advantage of 
this parametrization is that it immediately allows to recognize the symmetry 
inherent to the problem. A disadvantage is that the group operation is more 
complicated than matrix multiplication. The purpose of the first appendix is 
to collect information about this parametrization which is scattered around in 
the literature. This information is not necessary to understand and interprete 
the pictures of the metric spheres, but it is indispensible for generating them. 

The remainder of the paper is organized as follows: 

2. Prom finite plasticity to Lie groups 

Brief outline how plasticity leads to consider metrics on Lie groups. 

3. The underlying optimal control problem 

4. Symmetries and isometrics 

5. The structure of extremals 

Discussion of the (PMP), description of yield surface and flow rule. 

6. A sufRcent family for SL(2) 

Summary and short discussion of how to find best factorizations. 
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7. Comparison arguments 

Rigorous proofs for tlie sufficiency of the famiUy described in Section 6. 

8. Conclusion 

Brief outlook on future work and how to treat similar problems. 

Appendix A: Parametrizing the simply connected group SL(2,R) 

All information necessary to generate the graphics. 

Appendix B: The hyperbolic Reeds-Shepp-Car 

Missing arguments and comparison with the results in ||18|| . 

Appendix C: More details for SL(2,]R) 

Additional information clarifying some of the arguments given in Sec- 
tion 7. 

Notation. As we will have to write products of exponentials repeatedly, we 
need a shorthand notation. Let G be a Lie group with Lie algebra q and 
exponential function exp : g ^ G. Then we define 

M (Ai, . . . , Afc) := exp(Ai) ■ ■ ■ exp(Afc), keN, Ai, . . . , A^ e g. 

Thus M (■) is a map from IJfcGN 0^ ^- "^^^ '^^P (') depends, of course, 
on the group G, so one should write Mg(Ai,...). But except for a few 
situations in the appendix it will always be clear from the context in which 
group we are working, so we omit the subscript G most of the time. 

Acknowledgement. This research was supported by DFG within SFB 404 
Multifield Problems. The author is grateful to Alexander Mielke and Klaus 
Hackl for stimulating discussions. 

2 From finite plasticity to dissipation distances 
on Lie groups 

The idea to use left-invariant metrics on Lie groups in the modelling of elasto- 
plastic material behavior is due to Mielke, cf. [0. For a detailed overview of 
this approach we refer to [§. Let us quickly outline some of the main ideas 
of this approach. 

A global formulation of elastoplasticity 

Consider a body f2 C R'' that undergoes a deformation ip : Q W^. Let 
F = D(p denote the deformation gradient. Inelastic material behavior is 
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described by an internal state z from some set Z. The whole material model 
is based on two scalar constitutive functions, the elastic potential tl) and the 
dissipation potential A. These give rise to an elastic storage energy and 
a dissipation functional. Considering the evolution (yj(t),z(t)) under the 
influence of some time-varying external forces, the total elastic and potential 
energy (or Gibb's energy) at time t is 

£it,'^,z)= i V^(x,Dcp,2)dx- (£(t),£p), 

the second term corresponding to the work by external forces. The dissipa- 
tion A is supposed to depend only on the evolution of the internal state z{t), 
i.e., A = ^{x, z, z). One defines the dissipation distance D as 

5(x,;Zo,2i) =inf i / ^x, z{s), z{s)) ds \ G ^^([0, 1], Z), 

2(0) =2o, Z{1)=Z^ 

Integrating over VL one defines 1^(205^1) = -D(x, Zq; ^1) dx. Finally the 
total dissipation along a path z{t) is defined as 

Diss(z; [ti,t2]) = sup |^ r'(z(rj), z(rj_i)) \ti = tq < ■ ■ ■ < Tn = t2 

With these functionals one obtains a notion of solution processes without 
making any differentiability assumptions. A process {Lp{t),z{t)) is called a 
solution process over [0, T] if it satisfies the following two conditions: 

(S) Stability: S{t,cp,z) < £{t,^,z) +V{z,z) for all t G [0,T] and all 
comparison states (^,5); 

(E) Energy inequality: 

S{h, ^{h), z(ti))+Diss(z; [ti, h]) < S{t2, cp{t2), zih))- j (l{s), ^(s)) ds. 

This formulation does not involve any derivatives, neither of if)^ A nor of 
Dc^, 2. As is shown in ||^ this formulation is consistent with classical elasto- 
plastic flow rules if the solution is sufficiently smooth. A particular advantage 
of this global formulation is that it allows to derive incremental time-stepping 
algorithms which are robust. 
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Multiplicative elastoplasticity: constitutive laws 

So far we outlined the general approach without making any assumptions on 
the internal state space Z. Multiplicative elastoplasticity uses the splitting 
Dc^ = F = Fei Fpi and considers Fpi as an internal variable while the elastic 
potential i\} is supposed to depend only on Fgi = F Fpi^^. Actually z = Fpi^^ 
is used as internal state, and the internal state space Z is a connected Lie 
subgroup, say G, of GL{d). Typically, G = Z = SL{d), but other groups 
may be considered, too. The following constitutive laws are postulated: 

(Syl) Objectivity: (frame indifference) iIj{x,'R.F, z) = iIj{x,F,z) for all 
R G SO(rf); 

(Sy2) Plastic indifference: 'ip{x,Fg~^ , gz) = 'tjj{x,F, z), 
A(x, gz, gz) = A{x, z, z) for all g G G; 

(Sy3) Rate independence: A(x, z, az) = aA(x, z, z) for a > 0; 

Material symmetries may be captured, for example, in a group S C 0(c/) flG. 
Following the notation in we postulate this as constitutive law, too: 

(Sy4) Material symmetry: ^(x, F,27) = ^(x, F, 2;), 
A(x, Z7, 27) = A(x, z, z) for all 7 G 5. 

Property (Sy2) implies that the dissipation distance D defined in the pre- 
vious subsection is invariant under left-multiplication with elements from G, 
hence 

5(zo,Zi) = D{l,z^'z{) =: D{z^^z^). 

These metrics are the objects we want to study. Here we dropped the material 
point X for sake of simplicity. In the sequel we will never consider dependency 
on X. This does not necessarily mean that our considerations are limited to 
homogeneous media. There are suitable formulations where all considerations 
are first limited to a fixed material point x, and the final result is obtained 
by integration over fi, cf. |^ 

Dissipation distances on Lie groups and time optimal 
control problems 

From now on we will assume that the internal state space is a connected 
Lie group G with Lie algebra g. Therefore we slightly change the notation. 
Henceforth, we write g & G (instead of z E Z). Our next goal is to discuss 
the consequences of the constitutive laws (Sy2) and (Sy3). By (Sy2) the 
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distance function D: G x G ^ [0, oo] is left-invariant. For the dissipation 
potential A this means that A{g,g) = A{g^^g) for some A: g — [0, oo]. 
Thus, given A our goal is to analyze the function D : G ^ [0, oo], 

D{go)=mfy^'A{g~\t)g{t))dt:geC\[0,l],G), g{0) = 1, g{l) = goj- 

Rate independence (Sy3) implies A{aX) = aA{X) for all a > 0, X G g. 
Therefore the definition of D still contains redundancy. We can reparametrize 
curves by their A-length. 

Indeed, assume (Sy3) and A{X) > for all X 7^ 0. Now suppose that 
g E G\[0,l],G) is given. Let L{t) = Jl^ A{g-\t)g{t)) dt and set Li = 
We only need to consider finite length, so Li < 00. Now L{t) is different iable, 
and L'{t) > in [0,1]. 

If g{t) 7^ in [0, 1], then L'{t) > 0, and L has a differentiable inverse L^^. 
An elementary computation shows that go : [0, Li] — G is parametrized 
by A-length, hence for •yit) = g{L~^{tLi)) we obtain 7(0) = (7(0), 7(1) = 
(7(1), andA(7-i7)=Li. 

In the general case {g{t) = is possible), L{t) is only monotone increas- 
ing. In that case one uses L{s) = sup {t : L{t) < s} and shows that g o L 
is differentiable (although L need not be differentiable). Hence D can be 
characterized in the following ways 

D{go) = inf i / A{g-'g) dt : g e G\[0, 1]; G), A{g-'g) = const 
V' 9(0) = 1, g{l)=go 

= mf{T:{3geG\[0,T];G))giO) = l, g{T) = g,, A{g-'g)^l] 
= inf|T:(3(7GCi([0,T];G))^7(0) = l, g{T) = g^, A{g-'g)<lY 

Now let U = {X G g : A{X) < 1}. Then the last statement says we 
must look for solutions of the differential inclusion g~^g G U, with boundary 
data g{0) = 1, g(T) = go such that T is minimal. Thus computing the 
dissipation distance D is equivalent to solving a time-optimal left-invariant 
control problem on the Lie group G. Such problems are well-studied and 
standard results are available, cf. H, |lT 

Theorem 2.1 Assume thatU is compact and convex and D{go) < 00. Then 
there exists an absolutely continuous g: [0,D{go)] G such that g^^g G U 
a.e., g{0) = 1, and g{D{go)) = go. 
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Thus length minimizing arcs always exist within the class of absolutely con- 
tinuous paths, provided there is some path with finite length and the set 
W C g is compact and convex. If W is a 0-neighborhood then it is clear that 
D{go) < oo for all go G G. But this is still true under much weaker hypothe- 
ses. For example, let ((W)) denote the smallest subalgebra of g containing 
U. Then we have: 

Theorem 2.2 Assume that U = —U and {(U)) = g. Then D(go) < oo for 
all go G G. 

In control theory language: every go E G is reachable from the group iden- 
tity 1 along a trajectory of g~^g G U, see ^ Theorem 1] or [0, Thm. 5.1], 
for example. The condition {{U)) = g is also necessary because the set 
{go G G : D{go) < oo} is nothing but the reachable set (from 1) of the sys- 
tem g~^g G U. And if {{U)) 7^ q, this reachable set is contained in a proper 
subgroup of G. 

Still D{go) < 00 for all go E G may hold true under much weaker hypothe- 
ses. In fact, let S(U) := (expM+W) denote the subsemigroup of G generated 
by exp(]R"'"W). Then D{go) < 00 for all go E S(U), and 



Actually, S(U) = G may hold under extremely weak assumptions. To give 
just one more example, consider g = s[(2) and U = conv(0, P, ~Q) = 
[0, 1] conv(P, -Q). Then S{U) = SL(2) holds true. 

Control systems on Lie groups and automorphisms 

Given a set W C g we now consider the left-invariant control system given by 
the differential inclusion g^^g E U a.e., and analyze some of its properties. 
Left-invariance means that for a trajectory g{t) and an arbitrary go E G the 
path 'git) := gogif) is a trajectory, too. 

We now write %i{go) instead of D{go) because we want to consider various 
possibilities for U. Our first observation is that 



In fact, this is obtained simply by reparametrization. Next we observe 
Proposition 2.3 IfU = —U then %({go) = %A{go^) for all go E G. 



D{go) < 00 for all go E G 




'^(S'o) = - %i{gQ) for all W C g, /i > 0, and go E G. 



(1) 
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Proof. Take g : [0,t*] G with ^(0) = 1, g{t*) = qq, and let g{t) = 
g^^g{t* - t). Then 5^(0) = 1, g{t*) = ^o"', and 

r'(t)5^W = 9{f - t)-'go9o'9{t* - t) (-1) = -g{f - t)-'g{f - t) e -U. 

Since U — —U, the claim follows. □ 

For the metric D this means that D{go, gi) is symmetric iiU = —14, for then 
D{gi,go) = D{l,g~^go) = D{l,gQ^gi) = D{go,gi). Finally we observe that 
group automorphisms interact well with the ODE g{t) — g{t)u{t). For g E G 
we denote left-multiplication with (7 by A^, : G — > G, Xg{go) — ggg. As is 
differentiable, we denote its differential by dXg. 

Proposition 2.4 Let a e Aut(G') and let a' — da{l). Then 

da{g) = o?A^(g) a' dXg-i {g) for all g eG. 

In particular, if g{t) is such that g^^g e U, then g{t) :— a{g{t)) satisfies 
g-'gea'iU). 

As an immediate consequence we obtain the following estimate: 

Proposition 2.5 Let a e Aut(G') andU,U C q such that a'U C U. Then 

^uido) <%i{(^~^{go)) for all go eG. 

Proof. Let g: [0,t*] ^ G he absolutely continuous with g~^g G U a.e., 
^(0) = 1, g{t*) = a-\go). Then g{t) := a{g{t)) satisfies 

g-' 9 e a\U) C U, m = 1, g{t*) = 90- 

Hence T^igo) < t* follows. Since we may choose g such that t* is arbitrarily 
close to %{{a~^{gQ)), our claim follows. □ 

The material symmetry axiom (Sy4) can be re-interpreted now in the 
following way: right multiplication with g E G leaves D invariant, iff the 
inner automorphism = (g^ 1— ggog~^)'. G ^ G leaves U invariant, i.e., 
Ad{g)U = U. This implies that I^, leaves Tu invariant: %i = %i °lg- In other 
words: Ig is an isometry for the distance D. 
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Single-crystal plasticity 



In single-crystal plasticity the plastic flow occurs through plastic slips induced 
by movements of dislocations. These movements are generated by shearings 
or slip systems, say, = XaVa, a = 1 . . .m, where Xa,ya G M"^, = 
11^0, II = 1, and Xa -L Va- Geometrically, Xa is the slip direction and ya is the 
unit normal of the slip plane. All plastic flow has the form g = g Yla ^aS"' 
with t'o, > 0. Formally one distinguishes between 3°" and —5"" because 
mechanically the shp strains in these directions must be distinguished, cf. |0|. 

The associated Lie algebra is g = (({5'" : a = 1, . . . , m})) C 5l{d) because 
of trace(5'°) = 0, a = 1, . . . , m. In this case the dissipation functional has 
the form 



where > are threshhold parameters. The set U = {X : A(X) < 1} is a 
convex polytope: 

{X : A(X) < 1} = conv ({0} U {k'^ : a = 1, . . . , m}) . 

Indeed, since A(S'") < Kq,, the inclusion U D conv({0} U S°'}) obviously 
holds true. Conversely, if A(X) < 1, we find 71,..., 7^ > such that 



^ = T.a^c,S" and Y.a'^c^la < 1- Hcuce X = ^^Ki'^a^S"') with A« := 
i^ala > 0, J2a < 1- Wheucc X E conv({0} U 5""}). 



Thus the factorization problem described in the introduction can be in- 
terpreted as the problem of finding dissipation minimizing paths for a 2- 
dimensional single-crystal with four slip systems 5*^ = P, S'^ = Q, = —P, 

= —Q and equal slip rates Kq, = 1. Equivalently, we can say that 
we have two slip systems 5*^, S"^ and a symmetric dissipation functional: 
A{—X) = A(X) for all X G s[(2). The factorization cost T{g) is nothing 
but the dissipation distance from the identity: T{g) = D{g). 

3 The associated optimal control problem 

Before we state the control problem we fix some more notation. The Lie 
algebra s[(2) is the set of 2 x 2-matrices of zero trace. The bracket is the 
commutator: [X, Y] = XY — YX. The following matrices form a basis of 




s[(2): 




and 
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We observe that P = i(T + f/) and Q = i(T - t/). 




Figure 1: The set lA = conv(±P, ±Q) C sl{2) and the Lorentzian double 
cone {X e s[(2) : det(X) = 0}. 

Let U = conv(±P, ±Q) C s[(2). This set is simply a square in the plane 
MT + Mf/ C s[(2). Figure |1| shows how U is situated in sl(2). The Lorentzian 
double cone consists of all matrices X G sl{2) such that det(X) = 0. It is 
the set of all possible two-dimensional slip systems (plus the origin), cf. the 
discussion at the end of the previous section. The elements in the interior of 
the double cone all have purely imaginary spectrum, so they are generators 
for compact (circle) subgroups of SL(2). The plane M.H + MT C s[(2) is the 
set of symmetric matrices in s[(2). All elements outside the double cone are 
diagonalizable (over M), in fact they are conjugate to XH for some A > 0. 

Now we consider the left-invariant control system (ODE on SL(2)): 

g{t)=9itXt), g{t) e SL(2), n(-) G L-(M;W). (LICS) 

Admissible control functions are measurable, essentially bounded functions. 
The factorizations we are looking for are in one-to-one correspondence to 
the trajectories of (LICS) generated by piecewise constant controls. Given 
ti, . . . ,tk > and Ai, . . . , Ak E U , we let tj = J2i=i U — 0, . . . , fc) and 
define the control u: [0, r^] ^ W by u{t) = J2j X[tj_i,t,)- 




Ak u(t) 



'k-1 
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Let g{t) denote the associated trajectory of (LICS) with g{0) — 1. Then 



9{t) = < 



exp(Mi), te[o,n) 

M(tiA, (t-ri)A2), te[ri,r2), 



)/l {tiAi, ^2^2, • • • , tj_iAj_i, {t - Tj_i)Aj) , t e [Tj-i, Tj), 

and 5r(rfe) = M (^i^i, . . . , tkAk). 

The system (LICS) is controllable because P, Q generate 5 [(2) as a Lie 
algebra: [P, Q] = H, sl{2) ^RH + RP + RQ. Therefore every go e SL(2) 
is reachable from the group identity. Our Optimal Control Problem (OCP) 
consists of finding a time-minimal trajectory from the group identity 1 to g^o^ 

/ dt — > min, subj. to g = gu, u eU, g{0) = 1, g{t*) = g^. (OCP) 
Jo 

Since lA is compact and convex, a standard result in control theory states 
that time-optimal arcs (and controls) always exist. Using standard methods 
we will show that the corresponding controls are piecewise constant. This 
will tell us which factorizations have a chance to be optimal and in which 
sense the original factorization problem has to be modified in order to have 
solutions. Eventually our goal is to classify the optimal arcs of (OCP). 



Related problems 

There may be several non-isomorphic (connected) Lie groups having the 
same — i.e., isomorphic — Lie algebras. For example, the Lie algebras so (3) 
(real, skew- symmetric, 3x3) and su(2) (complex, skew-hermit ian, 2x2) 
are isomorphic, but the groups S0(3) and SU(2) are not (S0(3) has trivial 
center while the center of SU(2) is {±id2}). 

The local structure of the control system (LICS) only depends on the Lie 
algebra structure. Therefore, if we consider (OCP) on any other group G 
with Lie algebra isomorphic to sl(2), the whole discussion of the (PMP) will 
apply equally to any such G. 

For example, ii B — diag(l, 1,-1) denotes a bilinear form of Lorentzian 
signature on R^, then we can consider (OCP) on the group 

S0o(2, l) = {ge mat(3, M) : g'^Bg = B, det{g) = 1, g^s > O} 

because the Lie algebras 5o(2, 1) = |X : B + BX = O} and si{2) are 
indeed isomorphic. Note that S0o(2, 1) ^ SL(2) (the centers consist of one, 
resp. two, elements). 
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Therefore Problem (OCP) is strongly related to the Hyperbolic Dubins' 
Problem (HDP) and the Hyperbolic Reeds-Shepp-Car-Problem (HRSCP). 
We wish to emphasize this connection because it allows nice geometric inter- 
pretations of several of the arguments to come. 

Explaining Dubins' Problem (DP) is simple. Given two points Xq, Xi 

and tangent directions vo,vi in the plane M^, the goal is to find a C^-curve 
7: [0,L] ^ M2 g^ph ^-^^^ 

(1) 7(0) = xo, 7(0) = vo, -f{L) = xi, 7(L) = vi, 

(2) 7(3) has curvature k,{s) almost everywhere, and \n{s)\ < 1, 

(3) 7 is parametrized by arc-length, and L is minimal. 

One may interpret this as follows: Imagine driving a car in the plane. The car 
moves forward at constant speed 1 and its turning radius is limited. At time 
t = the car is located at Xq, pointing into the fo-direction. The goal is to 
drive the car in minimal time to position Xi, pointing into the f i-direction. If 
we allow the car to move backward as well as forward, we obtain the so-called 
Reeds-Shepp-Car-Problem (RSCP). 

The original sources for these problems are and 1 17 1 . Variational prob- 



lems with cost functional depending only on the curvature can be generalized 
directly to manifolds with curvature, in particular the sphere §^ and the hy- 
perbolic plane (constant curvature 1, resp. —1). In the latter case this 
leads to control systems on the Lie groups S0(3), resp. S0o(2, 1), an obser- 
vation first made by Jurdjevic in his paper on non-euclidean elastica. 

The hyperbolic Dubins' Problem (HDP) has been investigated by Monroy 
in 0. We observed in [|I^] that (HDP) is equivalent to finding time- 
optimal paths for the control system: 

g = d\g{l)u, u e conv(P, Q) a.e., g E PSL(2, M), (HDP) 

where PSL(2,M) = SL(2,M)/{±1} ^ S0o(2,l). The controls P,Q corre- 
spond to left, resp., right turns, and the control |(P + Q) = |T corresponds 
to a geodesic arc. 

Thus a product of the form M {rP, sT, tQ, . . .) corresponds to a path in the 
hyperbolic plane consisting of circular arcs and geodesic segments. Equality 
of two such products means that two (seemingly different) paths have the 
same initial and terminal positions and tangents. 

In order to visualize such paths we use the so-called conformal disc 
model of H^. Here is identified (as a set) with the open unit disc D = 

G C : < 1}. The geodesies are either diameters of D or circular arcs 
perpendicular to the unit circle. The circular arcs are parts of so-called 
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horocycles, i.e., ordinary (euclidean) circles touching the unit circle from 
inside. 



4 Symmetries and isometries 

Apparently the set U = conv(±P, ±Q) C s[(2) has some symmetries which 
we would like to exploit. Let t denote inversion, i.e., t{g) = g'^. Then we 
already observed that T{i{g)) = T{g) for all g G SL(2) because olU — —U. 
The appropriate strategy for finding more symmetries is to look for group 
automorphisms preserving the factorization cost T. 
Consider (Th-i ct, ctc/ '■ SL(2) SL(2), 



We observe that (7//, (Tt, CTf/ e Aut(SL(2)). For example, (Jnigogi) = HgogiH — 
HgoHHgiH = (THigo)(7H{gi)- Also, ajj^a^^ idsL(2), and au = cthctt = 
aT<JH- Hence V :— {idsL(2), ctj?, ctt, c;/} Q Aut(SL(2)) is a group, actually 

r ^ Z2 X Z2. 

For any Lie group automorphism a its derivative at the identity a' :— 
da{l) is a Lie algebra automorphism. In the present situation we have 



a'ff, a'-r, a'^ : sl(2) ^ sl{2), (7^(X) - HXH, a'^iX) = TXT, a'^iX) = -UXU. 



Apparently there seems to be no difference between an, and cr^, for example. 

Nevertheless the distinction makes sense because these maps have different 
domains and ranges. Next we observe that 



Thus cr'(ZY) = W for all fT e P. These are the symmetries we have been 
looking for because the following proposition holds true for any Lie group G: 

Proposition 4.1 Assume that a E Aut(G) and let a' = da{l) G Aut(g). 
If a'ipl) = U, then T{a{g)) = r{g) for all g E G. Moreover, D{go, gi) = 
-D(cr(g(o), cr(g(i)) for all go,gi E G, i.e., a is an isometry of the metric D. 

Remcirk 4.2 Although inversion i preserves T, it is not an isometry of the 
metric D, in general. In fact, it is an easy exercise to show that for a left- 
invariant metric on G we have: 

I is an isometry <^=^ D is right-invariant <^=^ D is bi- invariant. 



anig) = HgH, urig) = TgT, au{g) = UgU^ = -UgU. 



a'jjiQ) = -Q 



a'^iP) = Q, 
a'^iQ) = P, 



a'M = -P 
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The left-invariant metric defined by lA will be bi-invariant, for example, if G 
is abelian or if Ad{G)U = U. But one cannot expect bi-invariance otherwise. 



A general proof is easily obtained using Proposition p.5| . In the special 
problem that we consider here, an elementary computation allows to ver- 
ify that every a G F maps trajectories of (LICS) onto trajectories. Assume 
7()f:) = 7(t)-u(t) and let, for example, rjit) = anijit)) = Hj{t)H. Then 

= H'y{t)H = H-f{t)u{t)H = H^{t)H Hu{t)H = 7]{t) 

Hence T^anig)) — '^(fi') 9 follows. As the same is true for a]j^, 

T{aH{g)) = T{g) follows. 

In terms of the basis {iJ, T, U} of s[(2) the maps o"^, cr^, a'u are nothing 
but 180 degree rotations around the T-, and f/-axis. One can actually 
show that V = {a e Aut(SL(2)) : a\U) = U}. Finally, let F C Diffeo(SL(2)) 
denote the group generated by F U then 

F = {idsL(2), Oh, (Tt, ctu, icth, ior, i^u] ■ 

In particular, F is abelian (isomorphic to Z2 x Z2 x Z2), and T{(f){g)) = T{g) 
for all g E SL(2),0 G F. ^ 

We will use the group F in various ways. First it will allow us to streamline 
the discussion of the (PMP) and the comparison arguments. Practically 
it will allow us to reduce the number of factorization maps that have to 
be inverted from 64 down to 13: instead of solving 64 (systems of three) 
equations for the same right hand side, say g, one may solve 13 systems for 

various righthand sides from |0(5') : G f|. Although this does not really 

affect the overall computational cost, it is a great help for programming, 
testing, and debugging. 

It is also worthwhile to mention that the fixed point sets Fix{(f)) := 
{g : (f){g) = g},{(f) G F) provide information about the cut-locus. In many 
cases geodesies lose their global optimality once they hit some Fix(0). 

5 The structure of extremals 

We already observed that for U = conv(±P, ±Q) C s[(2) the left-invariant 
control system 

'y{t) = 'y{t)u{t), u{t) eU a.e., 

is controllable, and that every g is reachable from the group identity in 
minimal time. 
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Next we want to apply the Pontrjagin Maxmum Principle (PMP) to ob- 
tain necessary conditions for optimality. For invariant systems on Lie groups 
the (PMP) takes a particularly simple form, cf. |n|]. A proper statement 
requires some extra terminology: the Lie algebra dual and the adjoint and 
coadjoint action. 



Adjoint and coadjoint action 

Let G be an arbitrary Lie group with Lie algebra g. Let g* denote the vector 
space dual of g. Then there is a natural action of G on g and, by duality, also 
on g*. These are the so-called adjoint and coadjoint action. Both actions 
come from conjugation on the group. For go E G let I^,, = (g ^ %9%^) '■ 
G ^ G. Then I^^ G Aut((j). The adjoint action is obtained by differentiating 
: G — G at the group identity 1, Ad((?o) = '^Igo(-'-)- "^^^ coadjoint acton 
is obtained via duality. Instead of a coordinate-free discussion we now give 
explicit representations (in coordinates) for G = SL(2) because these will be 
needed in the subsequent discussion of the structure of extremals. 

So consider G = SL(2) and g = sl(2). Fix the basis {H, T, U} of g. So we 
can write X e 51(2) as X = hH + tT + uU with {h, t, u) e M^. Next we fix 
a dual basis in g*, so we may write p G g* as a row vector p = {ph,Pt,Pu), 
and {p, hH + tT + uU) = pnh + prt + puu. 

The adjoint action is conjugation Ad(5')X = gXg~^ for g G SL(2), X G 
s[(2). In terms of the basis {H,T,U} Ad{g) is a 3 x 3-matrix (actually 
Ad{g) G S0o(2, 1)). Although we will not need the exphcit expression, we 
state it just for sake of completeness: 

_ f CL b \ ^ / ad + bc —ac + bd —ac — bd 

^-[c d ) \^M{g) = [-ab + cd 2!^^!^ a^+^^-c--,- | ^3) 
det(^) = l J \-ah-cd ^l^^t^ a2^^^^ 



For X G s[(2) let ad(X) : sl(2) ^ s[(2), ad(X)y = \X,Y\. If X = hH + 
tT + uU, then in terms of the basis {H, T, U} we obtain 

/ 2u -2t^ 
ad(X) = -2m 2h 
\-2t 2h 

We note that ad: s[(2) —>■ so(2, 1) is a Lie algebra isomorphism. For P 
i(T + f/) and Q = 1(T - f/) we obtain: 



ad(P) = -10 , ad(Q) =1 . (3) 
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Since ad(P)^ = a.d{Q)^ = 0, we obtain 



3rad(P) 



T —r 

o2 2 ^ 

1 + ^. 

O J- r, / 



,Tad(Q) 



1 

r 

— r 



— r —T 

o 2 2 ^ 

O J- r, 1 



(4) 



Using the duality between st(2) and s[(2)* one obtains Ad(5')*p = p o Ad((y'), 
ad(X)*p = p o ad(X), and with our choice of coordinates the latter is noth- 
ing but left-multiplication of the row vector p with the matrix Ad((y'), resp. 
ad(X). The coadjoint action is defined as 

CoAd((7)p = Ad(r')>, pGs*. 

We are more in favor of this notation (rather than the frequently used Ad*(5') 
because it prevents confusion between Ad((y')* and Ad* ((7) = Ad((7^^)*. 

The Killing form is defined as Y) = trace(ad(X) ad(F)). For sl{2) 
it is a nondegenerate symmetric bilinear form of signature (+,+,—). The 
adjoint action leaves k, invariant, in particular the quadratic form ^^(X) := 
k{X,X) = trace(ad(X)^) is Ad-invariant, qK{Ad{g)X) = q^iX) for all X e 
sl(2), g G SL(2). With our choice of coordinates qf,{hH + tT + uU) = 

By duality, the quadratic form C{p) = pj^ +Pt~Pu is Ad*-invariant, i.e., 
C(Ad(^)*p) = C{p) for all p E sl{2)*, g e SL(2). As we will see soon, C{p) 
will appear as a first integral for any optimal control problem on any Lie 
group with Lie algebra isomorphic to s[(2). 

For p e s[{2)* let Op = {Ad{g)*p : g G SL(2)} denote its orbit under the 
coadjoint action. Roughly speaking, the coadjoint orbits are level sets of C, 
more precisely we have four different types of orbits: 

Hypl: If C{p) > the orbit Op is a one-sheeted hyperboloid. 

Hyp2: If C{p) < the orbit Op is the upper (p^/ > 0) or lower {pu < 0) part 
of a two-sheeted hyperboloid. 

Cone: If C{p) = 0, then either p = and Op = {0} is singleton, or p 7^ 
and Op is the upper {pu > 0) or lower {pu < 0) part of the (boundary 
of the) Lorentzian double cone, cf. Figure |l|. 

For p G sl(2)* we denote Gp = {g e SL(2) : Ad(5')*p = p} the stabilizer 
group of p, and Qp = {X G si{2) : ad(X)*p = 0} the stabilizer algebra 
of p. One quickly verifies that for p = {ph,Pt,Pu) 7^ one has Qp = 
R{PhH + ptT-puU). 
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The Pont rj agin Maximum Principle on Lie groups 
(for time-optimal problems) 

Theorem 5.1 Let G be a Lie group with Lie algebra Q, U g, and consider 
the (Lies) ■j(t) = ■j(t)u{t), u{t) e U. Assume that g{t) is a trajectory which 
is time- optimal on some interval I. Let u{t) = g{t)-'^g{t) G L°°{I;U). Then 
there exists an absolutely continuous covector function p : I ^ q* such that 

(0) p{t) 7^ for some {hence all) t & I, 

(1) Ad{g{t)~^)*p{t) is constant, 

(2) {p{t),u{t)) ^rmny^u{p{t),v), a.e. in I, 

(3) {p(t),u(t)) is constant {a.e.) , either —1, orO. 

A triple {g{t),p{t),u{t)) consisting of a trajectory g{t), a control u{t) 
generating g{t) and a covector p{t) satisfying conditions (0)-(3) is called an 
extremal of the optimal control problem. In the present situation u = g~^g, 
so specifying u is actually redundant. 

Let Ti.{p) = mm^(zu {p{t),v) denote the optimal Hamiltonian. By (3) 
Ti is an integral of motion. An extremal for which Ti.{p) = is called an 
abnormal extremal. The other extremals are called normal or regular 
extremals. Geometrically an extremal being abnormal means it satisfies the 
first order necessary conditions for any cost functional, not only the one 
under consideration. 

If W = —U optimal abnormal extremals will appear only in exceptional, 
degenerate cases. They will never appear, for example, if W is a zero- 
neighborhood, for in that case 'H{p) = =^ p = 0, and the latter is 
impossible by (0). 

The level set {Ti = — 1} is the yield surface found in the plasiticity 
literature, and the flow rule is encoded in (1) and (2). Differentiating (1) 
yields p{t) = p{t) o ad(-u(t)), while the minimizing condition (2) implies that 
u{t) lies in the subgradient of Ti. As usual the derivative-free formulation 
has the advantage that we need not make smoothness assumptions. 

In view of our discussion of symmetries and isometries we observe: 

Proposition 5.2 If a G Aut(G) satisfies cr'iU) = U, then a maps extremals 
onto extremals. Similarly, ifU — —U and {g,p,u) is an exiremal { in [0,t*]), 
then {g{t* — t), —u{t* — t), —p{t* — t)) is an extremal, too. 

The proof is an easy exercise. If {g,p,u) is an extremal and a as above, the 
image extremal is {a{g{t)), {{a')~^)*p{t), a'{u{t))). 
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Yield surface and flow rule for the model problem 

Using the basis {H, T, [/} of g = s[(2) and fixing a dual basis (of {H, T, U}) 
in g*, we write p = {ph,Pt,Pu)- With U = conv{±P, ±Q) the optimal 
Hamiltonian is 

np) = -l{\PT\ + \pu\). 

We already observed that there is another integral of motion: 

C{p) =Ph+Pt-Pu 
because C is constant along coadjoint orbits. 

Proposition 5.3 Abnormal extremals are not optimal. 

Proof. Let {g{t),p{t),u{t)) be an abnormal extremal. Then Ti.{p) = 
implies p{t) = {pnit), 0,0). Hence C{p) = PuitYi whence ph is constant. 
As p ^ 0, Ph 7^ must hold. As p is constant, p is differentiable. Let 
u{t) = ui(t)P + U2(t)Q. As = p = pog,d{u), u{t) G lA ng^ must hold. Since 
Qp = RH, u{t) = follows. Or, elementary, 

(0 Ui - U2 -Ml - U2\ 

-U1+U2 
-M1-M2 / 

so U1—U2 = and U1+U2 = 0, hence Ui = U2 = 0. Thus g = 0, i.e., g(t) = gQ 
is a constant path, hence not optimal. □ 

For regular extremals the covector p{t) evolves on the level set (yield sur- 
face) {7i = —1}. The latter is a cylinder over a square, cf. Figure ^ which also 
shows some flow lines. On each open face the minimizing control is uniquely 
determined. Switches (may) occur when p{t) hits one of the four edges, i.e., 
when pu{t) = or pT{t) = 0. At such a point the minimizing condition 
{p(t),u(t)) = min^g^Y {p{t),'f) does not suffice to characterize the control u(t) 
uniquely. But we also have the geometric information p{t) G {Ti. = —1}, and 
p{t) = p{t) o a.d{u{t)). Carefully exploiting all this information, one obtains 
that optimal controls are piecewise constant and that there are 4 types of 
extremals: 

(ALT) The optimal control u{t) follows an alternating switching pattern. 

Ph (-g) hPh (-Q)..., resp. Qh(-P)hQh(-P)..., 

and the time s between successive switches is a constant, < s < 2y/2. 
The corresponding path g(t) is a subarc of 

M{sP,-sQ,sP,-sQ,. . .) resp., M{sQ, -sP, sQ, -sP, . . .) . 
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U = P 



-P_T 



. P_H 



Figure 2: The yield surface {TC = — 1}. 

(CSP) The circular switching pattern (CSP), here the control u{t) switches 
in either clockwise or counterclockwise order from vertex to vertex: 

Phgh-Ph-ghP..., resp. Ph-gh-PhghP..., 

the time s between successive switches is constant, s G (0, v^). The 
corresponding path g{t) is a subarc of 

M(sP,sg,-sP, -sg,...) resp., M (sP, -sg, -sP, sg, . . .) • 

(SSP) The singular switching pattern(s) (SSP). In this case singular arcs 
may occur, the corresponding control is noi bang-bang (i.e., in {±P, ±g}). 
The singular controls are constant, ±|T, they may be applied on an 
interval of arbitrary length. The switching time for an intermediate 
bang arc is exactly a/2. Describing all possible switching sequences in 
general is complicated. All possibilities can be obtained as paths in the 
directed graph shown in Figure ^. If the S'-arc between 2 P-arcs has 
zero length, the value of the control function u{t) need not change. We 
call these virtual switches. Example: 

m[V2P,siT,V2P,-V2Q,-V2Q,V2P,S2T,V2P,..^ , si,S2 > 0. 

(U/2) The constant controls u± = ±|(P — Q) = ^^U. The corresponding 
path is g(t) = exp(±t • ^U), t >0. These are singular controls, too. 
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"0: 




Figure 3: The directed graph describing the singular switching patterns. 



From this characterization we can aheady deduce that every g G SL(2) 
has an optimal factorization with factors from the set exp(MP) Uexp(M(5) U 
exp(MT) Uexp(Mt/). 



How to find the switching patterns and times 

In this subsection we prove that indeed, our list consists of extremals only, 
and we explain how this list is obtained. We also indicate why there are no 
other extremals, but a rigorous proof has to be carried out in a different way. 
Therefore it is provided separately in the subseqent subsection. 

The yield surface has four faces, on each of the open faces we have PtPu 7^ 
0, and the minimizing condition (2) determines the control u uniquely (the 
subgradient dTi. is singleton). 

Due to the symmetries from F = {id, an, ctt, ctu} it suffices to consider 
only one of these faces, say, 

F = {p : pt + Pjj = 2, pT,pu > , and F 

because any other face is mapped onto F by (cr')~* for some cr € F. The 
maps (cr')^* (cr G F) are 180-degree rotations around the coordinate axes in 
s[{2)*. 

So let us consider p G F. Then dT-C{p) = {—P}. It is crucial to analyze 
what happens when p{t) hits the two boundary lines (0,2,0) + ]R(1,0,0), 
resp. (0,0,2) +M(1,0, 0). Therefore our next step is to consider the flow 
(p, r) I — >pe~''^^^^ for p G F, T G M. We compute 



-Tad(P) 



1 


— r 












2 



r 

2 



T G 



1 + 
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So {ph, 0, 2)6-'^^'^^^^ = {ph + 2r, -t{ph + r), +ph r + 2). Hence we have 




(0,2,0) 

Figure 4: The flow on the face F = {pu + pr = '^^Pu > 0,Pt > 0}. 
the following cases: 

Ph < -2V2: pe-^^'^(-P) G F for r G (0,r*) where t* = Ip^I~Vp1^ ^ ^nd 

Pfj = -2V2: pe-^^'^(^) G F for r G (0,2^2) \ {^/2}, and p^e'^^'^^^) = 
(0, 2, 0), p, e-2v^ = (2v^, 0, 2). 

Pf/ G (-2^2,0): pe-^^'^(^) G F for r G (0,r*) where r* = > 0, and 

pg-r*ad(P) = (|p^|,0,2). 

Pjj G (0,2^2): pe-^'^'^^-f') G F for r G (-r*,0) with r* = p//, cf. the 
previous case. 

Ph = 2V2: pe-^^'^^P^ G F for r G (-2v^,0) \ {-^2}, cf. the second case. 

Ph > 2V2: pe-^^'^^P'^ G Fforr G (-r*, 0) where r* = EHZ^MH ^ p e^' ^'^(p) = 
(Vp1F8,2,0). 

Applying symmetries we obtain the flow lines on the other open faces, too. 
Figure || shows the yield surface "unfolded" and the various possibilities for 
p{t). Recalling that C{p) = p\j + Pr — Pjj is constant along extremals we can 
state the following 

Proposition 5.4 Let {g,p,u) be an extremal such thatp{to) G ¥ for some tQ. 
Let C = C{p) and 

a = inf {t < to : (Vr G (t, to)) p(r) G F} 
(3 = sup {t > to : (Vr G (to, t)) p{r) G F} 

Then one of the following three cases occurs: 
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Figure 5: The Hamiltonian flow on {Ti = — 1}. 
CSP: C>4, puito) ^0, f3-a = Vc+i-VcTl ^ ^2), and 

PH{to)<0 p(a) = (-7^ + 4,0,2), piP) = i-VC~^,2,0). 

SSP: C = 4, pnito) ^ 0, P - a = V2, and 

PH{to)>0 ^ p(a) = (0,2,0), p(/3) = (2v^,0,2), 

PHito)<0 ^ p(a) = (-2v^,0,2), p(/5) = (0,2,0). 

ALT: C G (-4 ,4), (3- a = VCTA G (0, 2V2), andp{a) = {-VCTA, 0, 2), 
K/3) = (7^ + 4,0,2). 

It should be obvious why we used the labels (ALT), (SSP), (CSP). It is also 
obvious that the following triples {g,p,u) are extremals (t G M): 

(^exp (^±t ^T^ , (0, t2, 0), ±^T^ , (^exp (^±t ^U^ , (0, 0, ^2), ±^U^ . 

We refer to these as the singular arcs because the control u{t) is not an 
extreme point of U. The first one has C{p) = 4 while the second one has 
C{p) = —4. The geometric reason for the existence of these singular arcs is 
that for p G {(0, 0, ±2), (0, ±2, 0)} the yield surface and the coadjoint orbit 
Op have first order contact: &H{p) fl M.dC{p) ^ 0. 
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P ad(Q) 
p ad(-P) 



p ad(-P) 
P ad(Q) 



p ad(-P) 



p <0 
H 



U 



p >0 



P ad(-Q) 



p ad(-P)^[| 



Figure 6: The reason why switches must occur on the edges pu = 2 {pt = 2) 



We observe that an extremal with C{p) = 4 inevitably hits one of the 
points (0, ±2, 0). There we may "glue" it together with a singular arc. Hence 
we obtain that our so-called list of extremals really consists of extremals. 

In order to prove that our list of extremals is complete we must prove 
that switches must occur whenever p{t) hits one of the edges at some p ^ 
(0,±2,0). We first give a geometric explanation that would actually prove 
our claim if we knew that is piecewise different iable. Unfortunately the 
(PMP) only provides a p{t) which is a.e. differentiable. Therefore a rigorous 
proof has to be provided. This will be done in the next subsection. 

Let us consider p = {pu, 0, 2) first. The minimizing condition (2) implies 
u G conv(— P, Q). Next we compute 



p ad(-P) 
p ad(Q) 



^0 -1 V 
{pH, 0,2) I 1 I = {2,-ph,Ph) 
1 0, 



-1 -r 

(pH, 0,2) I 1 
-10 



-2, -pH, -Ph] 



Using the fact that p{t) evolves on the yield surface, p ad{u) must be subtan- 
gent to the yield surface. A glance at Figure |^ convinces us that for pn ^ 
this determines u uniquely. The same thing happens on the other edge for 
p = {ph, 2, 0) with Ph ^ 0. 

Proposition ETBI gives a flavor of how to attack a rigorous proof. The 
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remainder of this section is devoted to providing one, it is not mandatory for 
understanding the rest of this paper. 

Rigorous proofs 

In the previous discussion we already observed that the cases to distinguish 

are C{p) = -4, C{p) G (-4,4), C{p) = 4, and C{p) > 4. They correspond 
(in this order) to (U/2)-, (ALT)-, (SSP)-, and (CSP)-extremals, respectively. 
W.l.o.g. we only consider extremals {g,p,u) starting at g{0) = 1. 

Proposition 5.5 Let {g,p,u) be an extremal such that pu = on an open 
interval I. Then C{p) — 4, p{t) is constant in I , p = (0, ±2, 0), and u{t) = 
m I. 

Proof. Suppose p^/ = G /, then 'H(p) = — 1 implies \pt\ = 2 in /. 
Hence C{p) = p^ + Pt ~ Pu = P^h + 4, whence p{t) is constant in /. Thus 
= p = p a.d{u), so M G QpH— sign(pT) conv(P, Q). Since Qp = M.{phH+ptT) 
this intersection is nonempty iff pn — 0. Thus p = (0, ±2, 0) and u = T 
in 7. So C(p) = + 4 — in 7 and hence for all t as it is a first integral. □ 

Proposition 5.6 Let {g,p,u) be an eoctremal such that pt — Q on an open 
interval I. Then C{p) = —4, p{t) is constant for all t, either (0,0, ±2), u{t) 
is constant (either ), and g{t) — exp(^i[/). 

Proof. Suppose = in some interval 7. Then \pu\ = 2 in 7, 
and as in the previous proof we deduce that Ph is constant in 7, hence 
pit) = {ph,0,Pu) is constant in 7. Thus = p = p ad{u) yields u(t) G 
Qp n — sign(pj/) conv(P, —Q). As Qp = M.{phH — puU), the intersection is 
nonempty iff pn — 0. Hence p{t) = (0, 0, ±2) and u{t) = T^U in 7. Also 
C{p) = 02 + 02 - 4 = -4. 

Since \pij\ < 2 and pJi + Pt ^ Pu ~ ~^ hold for all t (not just in 7), we 
deduce p^ + Pr = ^ and \pu\ = 2 for all t. So p = (0, 0, ±2) and u = 
follows for all t. Hence g{t) = exp(=pi?7) follows, too. □ 

The switching surface for our problem is E := {p : p^/ = 0} U {p : = 0}. 

Up to symmetry (F) we have to consider only two cases: pn < 0,pt — 0,Pu — 
2 and pn < 0,Pt = '2,Pu = 0. 

Proposition 5.7 Let (g.p.u) be an extremal on some interval I and to & I 
such that p(to) = {p%, 0, 2) with p^ < 0. Then there exists an e > such that 
Prit) < in {to — s, to) and prit) > in {to, to + e). In particular, u{t) = Q 
a.e. in {to — s, to) and u{t) = —P a.e. in {to, to + s). Hence a switch occurs. 
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Proof. Since pu{to) — 2, take e such that pu{t) > for i e 7g := 
I n{to- e, to + e). Then u{t) e conv(-P, Q), so u{t) = -1 f/ + A(t) • |r 
for some measurable function A : — * [—1,1]. Since p{t) is absolutely 
continuous, we have p = p eid{u) a.e. in I^. Hence 

p^pSid(~U + X^T]^{pH,PT,Pu)\ 1 *). 



Thus Pt(^) = ~PH{t) a.e. in 7^. As pn is continuous, is differentiable in 

/f, and since Pt(^o) = 0, Prito) = ^Pnito) > wc find e such that prit) < 
in (to — £, to) and prit) > in {to, to + s). Hence u{t) — Q a.e. in {to — s, to), 
u{t) = —P a.e. in {to, to + s). □ 

The other case is treated similarly. 

Proposition 5.8 Let {g,p,u) be an extremal on some interval I and to & I 
such that p{to) = {p%, 2, 0) with p'j^ < 0. Then there exists an e > such that 
Pu{t) > m {to — e,to) andpT{t) < m {to,to-\-e). In particular, u{t) — —P 
a.e. in {to — e,to) and u{t) — —Q a.e. in {to, to + e). Hence a switch occurs. 

Proof. Since Pt(^o) = 2, we find e > such that pT{t) > for t G 1^ :— 
in {to- e, to + e). Then u{t) E conv(-P, -Q), so u{t) = -i T + X{t) ■ \U 
for some measurable function A : I^ — > [—1,1]. Since p{t) is absolutely 
continuous, we have p — p, ad(w) a.e. in 7g. Hence 

P^pad[-]-T + \]-U]={pH,PT.Pu)\-^ I = 




Thus pu{t) = Ph a.e., so pu is differentiable in I^ because pn is continuous. 
Since pu{to) — and pu{to) — PH{to) < 0, we find ei > such that pu{t) > 
in {to — ei,to) and pu{t) < in {to, to + ei). Hence u{t) — —P a.e. in 
{to — ei, to) and u{t) — —Q a.e. in {to, to + Ci). □ 

This finishes the proof that our fist of extremals is complete. It is note- 
worthy to mention that the whole discussion involves only the Lie algebra, 
its dual, and the adjoint, resp. coadjoint action. In particular our results are 
valid for any group with Lie algebra isomorphic to sl{2). 
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6 A sufficient family for SL(2) 



Although the (PMP) provides very detailed information, it is only a first or- 
der necessary condition. We need two types of additional arguments: global 
arguments (no local condition could replace them) and higher order varia- 
tions. Our goal is to provide a sufficient family of paths, i.e., a finite set of 
maps /fc : — ^ SL(2) such that for every g e SL(2) some map fk provides 
an optimal path 1 ^ g. 

Our classification of extremals provides us with candidates. Looking at 
alternating extremals we come up with 

A3(r,s,t) = M{rP,-sQ,tP) , <r,t <s <2V2, 

A4(r,s,t) = MlrP,-sQ,sP,-tQ), 

A5{r,s,t) = M{rP,-sQ,sP,-sQ,tP), (5) 

and A6, A7, . . . are defined similarly. Due to symmetry it suffices to consider 
only (ALT)-extremals starting with an exp(rP)-arc because the group T acts 
transitively on {±P, ±Q}- 

Similarly, when considering (CSP)-extremals, there are up to 8 possibil- 
ities (4 for the first vertex, 2 for the orientation: clockwise/ anticlockwise). 
The group r has 8 elements, and it turns out that for an odd number of 
factors it STiffices to consider 1 case, whereas for an even number of fac- 
tors there arc 2 distinguished cases. We may always assume that the first 
factor is exp(rP), but we must distinguish between clockwise and anticlock- 
wise switching patterns if the total number of factors is even. We obtain a 
sequence of maps starting with: 

C3{r,s,t) = M{rP,sQ,-tP) , <r,t <s < V2, 

C4a(r,s,t) = M{rP, -sQ, -sP,tQ) , 

CAc(r,s,t) = MlrP,sQ,-sP,-tQ). (6) 

For (SSP)-cxtremals we can always assume that (up to symmetry) the first 
singular {S-) arc is exp(| T), and that the preceding bang {B-) arc is exp(rP). 
For example, for 3 or 4 factors with one singular arc we obtain the maps: 

S3P{r, s,t) = M (rP, f T, tP) , s > 0, r, t e [0, ^2] , 

S3Q{r,s,t) = M(rP,fT,tg), 

S4:P{r, s,t) = M (rP, ^T,V2P, -tQ) , ^ ' 

SAQ{r, s,t) = M (rP, ^T,V2Q, -tP) . 

Enumerating all possible patterns for a large number, say n, of factors is 
an unpractical task since the number of possibilities grows exponentially in 
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n. After a singular {S-) arc we always have two choices for the next bang 
{B-) arc. But fortunately enough, it turns out that it suffices to consider 
(SSP)-extremals with one S-aic only. The proof is basically a verification of 
the identity: 

M(sr, \/2P,-\/2q) =M(y2P,-V2Q,-sT) . (*) 

Let us abbreviate w := a/2. Then (*) implies 

M (rP, SiT, wP, -wQ, -s^T) = M (rP, (si + S2)T, wP, -wQ) . 

Occurence of this identity is not a miracle but it is kind of locally detected 
by the (PMP): e^'^^e-^'^^T = -T = 2. So the singular switching 

time is in a certain sense geometrically distinguished. The hyperbolic Reeds- 
Shepp-Car problem allows to visualize this neatly, (cf. Fig. |^, p. |5^). This is 
completely analogous to the euclidean case, cf. |]18|, Fig. 18, p. 59]. 
Eventually it turns out that one only needs three more maps: 

S5P{r,s,t) = M{-rQ, wP,IT,wP,-tQ) , 

S5Q{r,s,t) = M(~rQ,wP,IT,wQ,-tP), (8) 

S7b{r,s,t) = Ml-rQ,wP,^T,wP,-wQ,-wQ,tP) 

In addition, it is convenient to consider also the following maps which are 
derived from the previous ones: 

S5a{r,s,t) = M (rP, ^T,wP, -wQ, -tQ) =SAP{r, s,t + w) 

S6{r, s,t) = M l-rQ, wP, f T, wP, -wQ, -tQ) =S5Plr, s,t + w) 

S7a{r,s,t) = mlrP, ^T,wP, -wQ, -wQ,wP,tP) 

= auiS5P{r + w,s,t + w)). (9) 

Table |1| specifies a family J-' consisting entirely of extremals. We obtain a list 
of = 76 maps, but since is F-invariant, it suffices to specify one map 
from each F-orbit. There are 16 such orbits. We list a representative, a sub- 
group generating the orbit and, if nontrivial, "the" stabilizer in columns 2-4. 
Now we can state the main result for SL(2). 

Theorem 6.1 Let JF denote the family of maps specified in Table |1]. Then 
J-" is a sufficient family for (OCP) in SL(2), i.e., for every g G SL(2) there 
exist f E and (r, s, t) G dom(/) such that f{r, s, t) = g, and the associated 
path from 1 to g has minimal length T{g) . 

The proof will be given in the next section. Observing S7b{r, s, t) = 
S5P{r, s, 2\/2) exp(tP), we immediately obtain 
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Factorization maps with relevant domain and cost 


Type 


Map 


Symmetry 


Domain 


Cost 


ALT 


A3 


r 


4 




< r,t < s < 2V2 


r + s + te[0, 6^/2] 






r 


4 




and s > 1 


r + 2s + te [2, 8^2] 






r 


4 


laH 


and s e [V2, VS] 


r + 3s + te [3^2, 5^3] 


CSP 


C3 


r 


8 




0<r,t<s<V2 


r + s + t e [0, 3^2] 




C4a 


r 


4 




and s < 1 


r + 2s + t e[0,4] 




C4c 


r 


4 


tCTT 




r + 2s + te[0,4 


SSP 


53P 


r 


4 


1(JH 


s>0,r,te [0, V2] 


r+s+t>0 




53g 


r 


4 












f 


8 






r + s + t + V2>V2 






r 


8 










S5P 


r 


4 


tan 




r + s + t + 2V2 > 2V2 




S5Q 


r 


4 


lau 








S5a 


r 


4 


laT 








S6 


r 


8 






r + s + t + 3V2>3V2 




S7b 


r 


4 


tax 






T 


16 




76 




Use bigger domains and drop some maps 


drop 


S5a 


r 


4 


SAP 


: [0, v^] X R+ X [0, 2V2] SL(2) 


drop 


S6 


r 


8 


S5P 


: [0, 2^2] X R+ X [0, 2^2] ^ SL(2) 




13 




64 




drop 


A3 


r 


4 








drop 


C3 


f 


8 








add 


B3 




2 


M{rP,sQ,tP), |r|,|s|,|t 




T2 


12 




54 





Table 1: A sufficient family of extremals for SL(2) and smaller families that 
suffice for computing optimal factorizations, resp., T{g) 
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Corollary 6.2 Every g G SL(2) has an optimal factorization with at most 6 
factors from |±P, ±Q,±|T}. There always exists a geodesic from 1 to g 
with at most 5 switches. 

As is indicated in Table |l|, there exist smaller families (fewer maps but with 
bigger domains) which are sufficient, too. For example, we find a family J-'i 
consisting of 64 maps, resp., 13 F-orbits, and another family JF2 consisting 
of 54 maps, resp., 12 orbits. 

Corollary 6.3 The families J-'i, T2 specified in Table ^ are sufficient for 
(OOP), too. 

The proof is an immediate consequence of the preceding theorem. The 
family T is the appropriate one for visualizing the metric spheres. That's 
the reason why we wanted to keep the domains as small as possible. The 
family is appropriate for computing T{g). In that case we want to keep 
the number of maps that have to be inverted as small as possible. Finally, 
J-'i is appropriate if we want to find optimal factorizations because in that 
case we have to keep track of the type of factorization (ALT or CSP). Thus 
it makes sense not to merge ^43, C3 into B3. 

Since computation of T{g) as well as finding optimal factorizations re- 
quires inverting the maps from resp., J-'i, it is important to realize that 
this is an easy task which can be carried out efficiently at low computational 
costs. Therefore we observe: 



Remark 6.4 We have exp(tP) = id+tP, and exp{tQ) = id +tQ. In partic- 
ular f{r,s, t) is linear in r and t for every f E J-'. The dependence on s is as 
follows: 



A3,C3,B3 


AA, CAa, CAc 


A5 


SSP 


linear 
in s 


quadratic 
in s 


cubic 
in s 


linear in ^, ^"^ where ^ = > 0. 
^ quadratic equation in ^. 



Hence inverting the maps in is nothing but a (time consuming) exercise in 
college algebra, except for A5 where we have p{s) = ejy45(r, s, t)ei = — 2s. 
It suffices to invert one map from each F-orbit because a{f{r,s,t)) = g iS 
f{r,s,t) = a~^{g). So instead of inverting, say, all 54 maps in JF2, it suffices 
to invert 12 representatives and apply them to several right hand sides, ac- 
cording to Table An efficient implementation can also use the information 
on the domains to "discard" solutions of f{r,s,t) = g as, soon as one of the 
parameters r, s,t is not in the appropriate range, i.e., dom(/). A smart im- 
plementation also uses the fact that some extremals inevitably generate large 
cost. For example, an optimal 745-extremal has at least cost 3a/2. Hence it 
is not necessary to check 745-type factorizations if some other factorization 
has already given cost less than 3 a/2. 
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7 Comparison arguments 



In this section we provide all the arguments necessary to prove Theorem pTl . 
The proof is based on comparison arguments. We will show that certain 
factorizations are not optimal. Hence they can never appear as a subarc of 
an optimal arc. 

We must treat the four classes of extremals separately. The singular 
(U/2)-extremals and the (CSP)-extremals are most pleasant in the sense that 
the proofs are purely Lie algebraic. Therefore they apply to any group with 
Lie algebra s[(2). The arguments for (ALT)- and (SSP)-extremals (partly) 
make very explicit use of the underlying group. We start with the elimination 
of (U/2)-extremals. 

Proposition 7.1 For alia the "factorization" exp (2a k{P — Q)) is not 
optimal. In particular, optimal controls take values only in {±-P, ±Q, ±|T}, 
and optimal factorizations only use factors from exp(]RP U RQ U RT). 

Proof. W.l.o.g. let us consider a > small. With r = tan(a/2) and 
s = sin(a) we obtain 

1/n f COS a sina\ ^ ^ n\ ^o/ \ 

exp 2a-P-g = . ] =MirP,-sQ,rP) = A3{r,s,r). 

\ 2 / \^— sma cos ay 

The cost of the lefthand side is 2a, on the righthand side we have cost 
sin(a) +2tan(Q!/2). Consider the difference f{a) = 2a; — sin(a) — 2tan(a/2). 
Then a simple Taylor expansion yields 

= /(O) = /'(O) = r (0), and /"(O) = 1. 

Hence for small a > the factorization on the righthand side is better than 
the factorization on the lefthand side. □ 



Remark 7.2 For the Hyperbolic Reeds- Shepp- Car Problem the previous re- 
sult has an interesting consequence. It says that a "rotation on the spot" 
is never optimal. So if one wants to drive and return to the starting point 
but heading into a different direction, it is better to move forward (or back- 
ward)! This is in contrast to the euclidean case where it doesn't matter if 
one moves forward or turns on the spot: both paths have equal cost. Of 
course this is important, because a rotation on the spot is not feasible for 
the Reeds-Shepp-Problem, it is only feasible for the convexified problem! 

This property reflects the fact that in a euclidean triangle the sum of 
the angles equals vr whereas in the hyperbolic plane this sum is strictly less 
than TT. 
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7.1 Circular (CSP)-extremals 

Our next goal is to prove: 

Proposition 7.3 Optimal (CSP)- extremals have at most 4 factors. This is 
actually true for any Lie group G with Lie algebra sl{2). 

The proof of this result requires two different types of arguments. The 
first one is elementary and involves only the adjoint action. It is a matter of 
elementary computations to verify that 

g.adQ(_p) ^giAadP(^2g>)^ and 6^ ^ ^(-Q) = e^/'' ^ (r'P) , r ^ 0. 

Proposition 7.4 The factorization M (rP, —sQ, —rP) is not optimal for 
s e (l,-\/2) and r > 1. The factorization Wl{P,—Q,—P,tQ) is not opti- 
mal for t > 0. In particular, optimal {CSP) -extremals with switching time 
s > 1 have at most 4 factors. 

Proof. Let s e (1, -\/2) and r > 1, assume w.l.o.g. r < s. Then we 
obtain M{rQ, —sP, —rQ) = M (^P, r^s Q, —j.P) from the above equation. 
Comparing the factorization costs on both sides we obtain 

(s + 2r) - - ^^5^ = ^ (^^ - -rs)>0 if s e (1, \/2), r e (1, s). 

Thus M (rP, —sQ, —rP) is not optimal for s E (1, \/2) and r > 1. 
To prove the second claim we consider r = s = 1 and obtain 

M (p, -g, -p, tQ) = M (g, -p, -g, tg) = m (g, -p, -(i - t)g) . 

The lefthand side has cost 3s + 1 as opposed to 3s — t on the righthand side. 
Thus our claim is proved. □ 

While this was quite elementary, proving that optimal (CSP)-extremals 
with switching time s G (0, 1) have at most 4 factors requires some more 
sophisticated arguments. Therefore we first supply another, relatively simple 
argument, that allows to reduce the number of factors to 5: 

Proposition 7.5 For all s > the factorization M (sP, —sQ, — sP, sQ) is 
not optimal. 
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Proof. For s > let r = G (0, s). Then an elementary computation 
yields: 

M (rP, -sQ, -sP, rQ) = M (-rP, sQ, sP, -rQ) . 

Thus 

M{sP,-sQ,-sP,sQ) = M{{s -r)P,rP,-sQ,-sP,rQ,{s-r)Q) 

= M{{s-2r)P,-sQ,sP,{s-2r)Q). 

The cost of LHS and RHS are 4s, resp., 2s + 2|s — 2r|. As < r < s, we 
deduce \s — 2r| < s, i.e., RHS is better. □ 



Remark 7.6 The preceding argument has a nice geometric interpretation. 
Both factorizations (or arcs) actually generate a parallel translation along the 
geodesic perpendicular to the initial tangent. So initial and terminal tangent 
are both perpendicular to the geodesic joining the initial and terminal point. 




M{rP,-sQ,-sP,rQ) 

versus 
M{-rP,sQ,sP,-rQ). 



A kind of second order variational argument allows us to prove the fol- 
lowing stronger result: 

Proposition 7.7 Let G be an arbitrary Lie group with Lie algebra sl(2). 
Then for s G (0, 1) and t > the factorization M (sQ, — sP, — sQ, tP) is not 
optimal. 

Proof. Fix s G (0,1) and consider 7(t) = M{sQ, -sP, -sQ,tP). Let 
P : ^ G, P(x) = MixiP, X2Q, -X2P, -xsQ). Then 7(0) = P(0,s,s). 
We claim that the following statements hold true: 

(1) The differential dF{0,s,s) is invertible. Therefore there exist to > 
and a smooth (actually analytic) curve x: {—to, to) such that 

F{x{t)) = 7(t) for all |t| < to- 
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(2) We have Xk{t) > for i e (0, to),k^ 1, 2, 3. 

(3) Let 5{t) =3s + t- {xi{t) + 2x2(1) + X3{t)). 
Then = 5{0) = 5'{0), and 5"{0) > 0. 

Once we have proved (l)-(3) it is clear that M{sQ, —sP, —sQ,tP) cannot 
be optimal because for small t > the map F{x{t)) provides a better fac- 
torization. So let us prove (l)-(3). First we must compute the differential 
dF{x). We observe: 

dF{x)e2 = F{x)e'^^^e'^^^ {Q - P), 
dF(x)e3 = F{x){-Q) 

Thus dF{x) = F{x) e'^'^^e'^'^^ M{x2) with 
In terms of the basis {H,P,Q} we compute 



M(s)= 1 -1 s2 , detM(s) = -2s(s2 - 1). 




Since < s < 1, Claim (1) is proved. Next we use the Imphcit Function 
Theorem (IFT) to compute the derivate i;(0). Differentiating F{x{t)) — ^{t) 
we obtain dF{x{t))x{t) = ^{t) = (iA^(t)(l)P. Thus 

Since a;(0) = (0,s, s), we deduce x{Q) = M{s)~^e~'''^^e~''^^ P. Simplifica- 
tion with Mathematica yields: 

x{0) 



2(1 2' 2(-l + s2 

Since ^2(0) = 2:3(0) = s > 0, continuity of x{t) yields 2:2 (t) , 2:3 (t) > for 
sufficiently small t > 0. Now Xi{0) = and ii(0) > because of 1 — > 0, 
hence Xi{t) > for small positive t follows, too. Thus we may assume w.l.o.g. 
that Xk{t) > for t e (0, to), ^ = 1, 2, 3 which proves Claim (2). 

Verifying 5(0) = = ^'(0) is straight forward. Finally, we must use the 
(IFT) again to compute 5"(0) = -(xi(0) + 2x2(0) + ^3(0)). Differentiating 
the identity 

M{x2{t))x{t) = e-^2(*)^'^'^e-^3(*)^'^^P 
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we obtain 

M{x2)x = -X2M'{x2)x - X2e-''^^'^'^^ ad{P)e-'''^'^^'^^P 
Simplification with Mathematica yields 



x(0) 



' 4{-l+s2)2 * 

4 

V 4(-i+s2)2 y 



2(1 -s2) • 



For < s < 1 the denominator is positive, hence S"{0) > follows. This 
proves Claim (3) and finishes the proof of the proposition. □ 



Now we are ready to prove Proposition [7.3| . 
Proof. Consider a (CSP)-extremal with 5 factors. Up to symmetry (F) 
we may assume that the first factor is exp(rP). The switching pattern is 
clockwise or counterclockwise: 

m{rP,sQ,-sP,-sQ,tP), or M{rP, -sQ, -sP, sQ,tP) , 

with < r,t < s < a/2. Recalling that tan simply reverts the factors, we see 
that the second pattern is transformed into the first pattern. 

For s > 1 Proposition \7.4\ applies, showing that this arc contains a non- 
optimal subarc. For s G (0, 1) we apply the previous proposition and obtain 
that M {sQ, —sP, —sQ, tP) is not optimal. This finishes the proof. □ 

We conclude our discussion of (CSP)-extremals with one more observa- 
tion: 

Proposition 7.8 Let r > and s G (1, V2). Then M{rP, sQ, -sP) is not 
optimal. In particular, an optimal (CSP) -extremal with switching time > 1 
has at most 3 factors. Conversely, if C4:a{r, s,t) or C4c(r, is optimal, 
then s < 1 and r + 2s + t < 4. 

Proof. We compute M (rP, sQ, —sP) = M [xiQ, X2P, X3Q) with 



^2 

X2 = —{s + r(s^ — 1)), Xi = , and X3 = — . 

X2 X2 



For s > 1 and r > we have X2 < — 1, Xi > 0, and X3 < 0. Hence 
+ 1x2! -|- Ixsl = Xi — X2 — X3, and 

2s + r - xi -X2- X3) = \ \ 

For r > and s G (1, V2) the righthand side is positive, hence M (rP, sQ, —sP) 
is not optimal. □ 



36 



7.2 Alternating (ALT)-extremals 

While the results for optimal (CSP)-extremals are valid in any group G with 
Lie algebra 5l(2), some of the results for (ALT)-extremals depend explicitly 
on G. The crucial parameter is actually |^(G)| =: the cardinality of the 
center of G. We start with results that hold for any G. Whenever we make 
explicit use of G (resp. n) we will indicate this clearly. 

Proposition 7.9 For s e (0, 1) andr > the factorization M{rP, —sQ,sP) 

is not optimal. In particular, an optimal (ALT) - extremal with switching time 
s G (0,1) has at most 3 factors. Conversely, if A4:{r,s,t) is optimal, then 
s>l. 

Proof. We compute M {{rP, -sQ, sP)) = M {xiQ, X2P, X3Q) with 

rs 



^1 ^ n 2\' X2 = S + r{l-s), X3^ 



s + r(l — s^)' ' s + r(l — s^) 

For s G (0, 1) and r > we deduce X2 > and xi,X3 < 0, so the second 
factorization has cost —xi + X2 — X3. The cost difference is 

r + 2s- {-Xi +X2- X3) = — r-. 

s + r (1 — s^j 

For s G (0, 1) and r > numerator and denominator are both positive, so 
r + 2s — {—Xi + X2 — X3) > 0. Hence M (rP, —sQ, sP) is not optimal. □ 



Proposition 7.10 The factorizations 

M (P, -Q, P, -tQ) and M (^P, -V2Q, V2P, -V2Q, tP^ 

are not optimal for t > 0. In particular, an optimal {ALT) -extremal with 
s — 1 has at most 4 factors, and with s — y/2 it has at most 5 factors. 

Proof. Let w — y/2. An elementary computation yields 

M (P, -g, P) = (^_°^ , M (wP, -wQ, wP, -wQ) = - (^J 5 

Since au{U) = U and au{-l) = -1, we get M P, -Q) = M (P, -Q, P) 
and M {wP, —wQ, wP, —wQ) — M {—wQ, wP, —wQ, wP). Hence 

M{P,-Q,P,-tQ) = M{-Q,P,-{t + l)Q), 
M{wP,-wQ,wP,-wQ,tP) = M{-wQ,wP,-wQ,{t + w)P). 
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In each case LHS and RHS have equal cost, but for t > the righthand side 
is not an extremal, hence it cannot be optimal. We also observe that for all 
r G M the identities 

Mi{l ~r)P,-Q,P,-rQ) = 

M {{w - r)P, -wQ, wP, -wQ, rP) = - 

hold true. Therefore an optimal (ALT)-extremal with s = 1 and 4 factors 
may be replaced (at equal cost) by another extremal with 3 factors. Similarly, 
for switching time s = \/2 it suffices to consider at most 4 factors. □ 

The same technique as in the proof of Proposition can be used to 
obtain: 

Proposition 7.11 For 1 < s < -\/2 the factorization M.i—sQ,sP, —sQ,tP) 
is not optimal. In particular, optimal (ALT) -extremals with switching time 
s E (1, v^) have at most 4 factors. Conversely, if A5{r, s,t) is optimal, then 
s>V2. 

Proof. Let 7(t) = M{-sQ, sP, -sQ,tP) and consider F:R^ G, 
F{x) = M{xiP,-X2Q,X2P,-X3Q). Then 7(0) = F{0,s,s). We claim that 
dF{0, s, s) is invertible and that there exists a to > and a smooth curve 
x: (— to,io) ^ such that Xk(t) > for t G (0,to)5 k = 1,2,3, and for 
S(t) =3s + t- (xi(t) + 2x2it) + xsit)) we have = 5(0) = 5'(0), 5"(0) > 0. 
Computing the differential of F yields: 

dF{x) = F{x) e''''^^e-''^'^^M{x2) 

with 

M(x2) = [e"2"'^^P, -Q + P, e"^"'^^(-Q)] . 
In terms of the basis {if, P, Q} we obtain 

f-s -s\ 
M(s) = 1 1 s2 , detM(s) = -2s(s2 - 1). 
\-s^ -1 -1/ 

Differentiating F{x{t)) = 7(t) yields 

gX3adQg-X2adP^(^^^^)^ = P, M{x2)x = g^^ P^-xg ad (3p_ 
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For a:;(0) = (0, s, s) we obtain 

1^-2 + 2s2' 2' -2 + 252 

As s > 1, Xi(0) > 0. Therefore Xk{t) > for t > small follows. With 
6{t) = 3s + t — {xi + 2x2 + 2^3) we have 5(0) = = 5'(0). Differentiating once 
more we deduce 

M{x2)x = -X2M\x2)x+X2e''''^^ ad{P)e-''''^^P-X3e''^'^^e-''''^^ ad{Q)P. 
Mathematica yields: 

'4 -85^-35^ + 356 -(-2 + 5^)' (-2 + 5^)' (1-552 + 85^)' 
45 (-1 + 52)' ' 4^ ' 4 5 (-1 + 52)' 

^ 5(4-852 + 35^) ^ (5 (2-5^) (-2 + 3 5^)) 
^ ' 2-252 2 (-1+5) (1+5) ■ 

For 1 < 5 < V2 the numerator is positive and so is the denominator, hence 
5" (0) > as we claimed. This finishes the proof. □ 

Up to this point all statements hold true in any group with Lie alge- 
bra sl{2). The last proposition in this subsection makes very explicit use of 
SL(2). 

Proposition 7.12 Let s > \f2 and r,t > 0. If r + t > then the 




factorization M (rP, —sQ, sP, —tQ) is not optimal in SL(2). In particular, 

s > ^/S =^ M{~sQ, sP, —sQ) not optimal in SL(2), 

5 = V3, t > =^ M {-sQ, sP, -sQ, tP) not optimal m SL(2), 
se{^/2,^/3) =^ M{-sQ,sP,-sQ,sP) not optimal in SL (2) . 

In particular, optimal (ALT) -extremals in SL(2) have at most 5 factors, and 
if A5{r, 5, t) is optimal, then s G [\/2, \/3] 

25 

Proof. For 5 7^ ±1 let u = — , then 

5^ — 1 

M ii^P, -sQ, sP) = M{sQ, -sP, iiQ) . 

Now assume w.l.o.g. that r,t < ji. Then 

M{rP,-sQ,sP,-tQ) = M{-{l^-r)P,i2P,-sQ,sP,-i2Q,{n-t)Q) 

= Mi-{ii-r)P,sQ,-sP,{^-t)Q). 
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The LHS has cost 2s + r + t, the RHS has cost 2s + 2/i — (r + As r + t > /x, 
2s + r + t > 2s + fi > 2s + 2 fi — {r + t) , so LHS is not optimaL 

For s> VS we can apply the previous result with r = and t = s > ji. 
For s = we observe /i = s = a/3, and 

M (sP, -sg, sP, -tQ) = M (sg, -sP, sQ, -tQ) = M (sg, -sP, -(s - t)g) . 

Comparing costs we obtain 3s + t (LHS) and 3s — t (RHS), so LHS is not 
optimal for t > 0. 

Finally, for s G (\/2, -\/3) we have s^ — 1 > 1, hence 2s > -^zi- Taking 
r = t = s we therefore have r + 1 = 2s > 2fi. Hence M (sP, —sQ, sP, —sQ) is 
not optimal in SL(2), whence optimal (ALT)-extremals cannot have 4 + 2 = 6 
factors. □ 

The following picture in the hyperbolic plane shows geometrically where the 
mysterious identity comes from. 



M (/iP, -sQ, sP) = M (sg, -sP, fiQ). 



It is noteworthy to mention that the bound on the number of factors of 
optimal (ALT)-extremals always depends on the group. In particular we 
woul like to stress that for the simply connected group SL(2,]R) there is no 
a priori bound on the number of factors. 

7.3 Singular (SSP)-extremals 

There are two reasons for non-optimality of (SSP)-extremals. One of them 
is purely local, the other one is global in nature. Looking at the graph from 
Figure ^ we can say this as follows: For (SSP)-extremals that switch between 
top and bottom line (of the graph) we get a bound on the number of factors 
using purely local arguments. An (SSP)-extremal that does not switch be- 
tween top and bottom basically looks like an (ALT)-extremal with switching 
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time 2-\/2 plus an interspersed yS-arc. A global argument (depending explic- 
itly on the group) is indispensable to bound the number of factors for these 
(SSP)-extremals. 

To save some typing throughout this subsection we let w = \pl. We recall 
that this is the "switching" time of the bang-arcs of an (SSP)-extremal. The 
quotation marks indicate that a switch might be virtual because the singular 
arc between two identical bang-arcs may have zero length. 

Proposition 7.13 Let w = \f2. Then the following identities hold true in 
SL(2); 

m(^sP,-'^Q^ = -M0g,-sP^, ifs^O, (10) 

M{wP,-wQ) = -M{wQ,-wP), (11) 
MisT,wP-tvQ) = M{wP,-wQ,-sT), s G R, (12) 

(2 T XL) \ 
Q, -{r + w)P, Q ,r e M^13) 
r -\-w r -\-w J 

M{wP,wQ,-wP) = m(^-Q, -2wP, --Qj . (14) 

The proof consists of nothing but elementary computations. The next 
result is valid for any group. Basically it means that the crucial (SSP)- 
extremals are those that look like (ALT)-extremals with switching time 2y/2 
with an interspersed ^-arc. 

Proposition 7.14 The factorization M.{wP,wQ, —wP) is not optimal in 
any group. In particular, if an optimal (SSP)- extremal contains 5 B-arcs, 
then for each S-arc the two adjacent B-arcs must be equal. 

Proof. In view of the last item of the previous proposition we observe 
that 

/UJ w \ 

M (wP, wQ, -wP) = M y-Q, -2wP, -^Q) ■ 

Both factorizations have equal cost 3w, but the righthand side cannot be 
optimal because it does not come from an extremal! Indeed, (ALT)-cxtremals 
are impossible because the switching pattern is circular. (CSP)-extremals are 
impossible because they have switching time s e (0, \/2) whereas here the 
switching time is 2^/2. Finally, (SSP)-extremals are impossible, too, because 
for an (SSP)-extremal with middle arc exp(— 2wP) the third arc would have 
to be exp(i(5) with t>0. □ 

A similar argument allows to prove the following stronger result: 
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Proposition 7.15 For r,s > the factorization WllrP, sT,wQ, —wP) is 
not optimal. In particular, an optimal (SSP) -extremal that switches between 
top and bottom has at most 5 factors. 



Proof. Let r,s > 0. Using Proposition [7.13| ( p!2D and (|T3|) we obtain 



(2 T vo \ 
Q, -(r + w)P, g, -sT . 
r + w r + w J 

Both sides have equal cost r + s + 2w, but the righthand side is not an 
extremal if r > and s > 0, hence it cannot be optimal. □ 

One more argument is needed. 

Proposition 7.16 In SL(2) M (rP, sT, wP, —wQ, —wQ, wP) is not optimal. 
In particular, optimal {SSP)- extremals in SL(2) have at most 7 factors. 

Proof. Due to Eqn. ([T8|) we may shift the 5'-arc to the right end, and it 
suffices to show that M {rP, wP, —wQ, —wQ, wP) is not optimal for r > 0. 
Next we use the seemingly obscure Eqn. (|T0|) whose geometric meaning 



becomes clear in the discussion of (HRSCP) (cf. proof of Proposition p3.6D 
namely 

sP,--q] = -m(-Q,-sP 



which is valid for all s ^ 0. We consider s = r + w > w. Then - < — = w. 



Let e = w ^ > 0. Then we obtain: 

w-\-r 



((r + w)P, -wQ, ~wQ, wP) = M Q, -{r + w)P, -eQ, -wP, wQ 



w 



Both factorizations have equal cost. But the second factorization cannot be 
optimal because it does not come from an extremal! Indeed, the switch- 
ing pattern is neither alternating nor circular. Besides the switching times 
r + w,e,w are not equal. Hence M {{r + w)P, —wQ, —wQ, wP) cannot be 
optimal in SL(2). An (SSP)-extremal with 8 factors will always contain a 
subarc of the above (non-optimal) form. □ 

Our last task in this subsection is to show that the previous propositions 
imply that the singular extremals listed in Table are sufficient. Up to five 
factors we obtain the factorization maps S?>P, SAQ, SAP, SAQ, S5P, S5Q, and 



5*50. Due to Proposition |7.15| we know that optimal (SSP)-extremals with 



more than five factors look like (ALT)-extremals with switching time 2 a/2 
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and an interspersed S'-arc. In view of Eqn. (|12|) the S-aic may be the second 
or third arc in the product. But if the number of factors is even, we may 
revert the order of the factors (i.e., apply tern), if necessary, so that we 
can also assume that the S'-arc is the second arc. Therefore it suffices to 
consider a single factorization map (say, 5*6, 58, . . . ) for an even number of 
factors, but it is necessary to distinguish between, say, S7a = M {rP, | T, . . .) 
and S7b = M {—rQ, wP, | T, . . .) , resp. S9a, S9b, . . . , for an odd number of 
factors. As STa-extremals are not optimal in SL(2), this naming convention 
may look surprising, but it perfectly makes sense if one also wants to consider 
the simply connected group SL(2, M). Since S'8-extremals cannot be optimal, 
we see that the family J-' given in Table |l] exhausts all possibilities. Thus the 
proof of Theorem ^]T| is finished. 

8 Conclusion 

Having proved that the families T^T\^T2 are sufficient, it is natural to ask 
if they are minimal with this property. The answer is affirmative in the sense 
that for every / G there exist (r, G dom(/) such that /(r, is op- 
timal, and no / G T \ j allows us to reach the same endpoint at equal 
cost. Rather than providing a list of such cases, we include pictures of 
metric spheres S(c) = {(7 : Tig) = c} for some values of c, cf. Fig. |^. To 
generate these pictures we used the parametrization of SL(2), resp., its sim- 
ply connected covering group SL(2,R) described in Appendix A. To under- 
stand the pictures it suffices to know that SL(2) is identified as a set with 
WH + MT + (— vr, 7r]f/ C s[(2) = R'^. In this parametrization the symmetries 
aH,crT,cru are 180-degree rotations around the H-, resp., T, ?7-axes. Inver- 
sion is simply, i{X) = —X, and tan is reflection in the Tt/-plane. The 
horizontal plane Mif -|- MT corresponds to symmetric, positive deflnite ele- 
ments of SL(2) while the vertical segment {—it, n]U corresponds to the circle 
group exp(Rf/) = exp([-7r, 7r]f/) = S0(2). 

For c = 1 the relevant maps are A3, C3, CAa, CAc, S3P, and S3Q. The 
four y43-patches make up the top and bottom part. The thin sides with the 
flgure eight curves consist of S3P- and S'3Q-patches, the S'3P-patch is inside 
the flgure eight. The "flat" sides consist of eight patches: four C3- and two 
CAa- resp., C4c-patches. The C4a-patches connect (horizontally) to an S3Q- 
patch (because C4a(0, s, 0) = M {—sQ, —sP)) whereas the C4c-patches con- 
nect (vertically) to y43-patches (because of C4c(0, s, 0) = M {sQ, —sP)). One 
can see that the C3- and 5'3-patches fit in perfectly while the ^43- and C4- 
patches have nontrivial intersections. 

The shape of the sphere §(1) still reminds us of the fact that the gener- 
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ating set lA = conv(±P, ±Q) is a flat square. 

For c = 2 we also have SAP- and S'4Q-patclies emanating from the S3- 
patches, and connecting them to the ^43- and C3-patches, respectively. One 
can also see how larger portions of C4a- and C4c-patches intersect. 

For c = 3 tiny triangular patches appear, the S5P- and SSQ-patches. 
One also sees how A4-extremals arise on the top surface. The S'5a-patch is 
also a tiny triangle adjacent to the 5'4P-patches. 

For c = 4 the C4-patches have completely disappeared (cf. Proposi- 
tion |7l8|) . The only change compared to S(3) is, of course, that the S5- and 
A4-patches are much larger, hence better visible. 

The last two pictures show S(c) for especially critical values of c. For 
c = 3v^ the sphere touches the plane M.H + MT + nil for the first time, 
A3-extremals make up only a very thin portion of the top while A4- and 
A5-patches are clearly visible. Nevertheless the AS-patches are quite small 
which means that the set of matrices for which A5{r,s,t) is the optimal 
factorization is quite small and close to —1 G SL(2). 

For c = Y a/2 (ALT)-extremals have disappeared (which means they 
are above ttU) and all patches come from (SSP)-extremals. Except for the 
S4Q- and S'SQ-patches there are no further intersections. So in particular, 
S3P, SAP, S5P, S5a, S6, STfo-patches perfectly fit together, only the portions 
where the vertical coordinate exceeds vr are chopped off in SL[2). They 
would still be present in SL(2,M), though. 

Finally these pictures also make clear that the set Fix(?o"T) ^ SL(2) is a 
cut locus for the problem. In the pictures Fix{taT) is the plane M.H + M.U. 
An extremal that hits this plane transversally will lose its global optimal- 
ity. A glance at Table reveals that this applies to A3, A5,C4:a, SAQ, S5Q- 
extremals; it does not apply to S5a, S7b, C4c-extremals because the latter 
are zaT-invariant. 

Before we give an outlook on generalizations and future work we now give 
a proof for Theorem |1.3| . 

Proposition 8.1 Let S\ S"^ e s[(2) with det{S^) = det{S^) = 0, and 
assume that [S^jS"^] ^ 0. Let U = comr{±S^,±S^). Then there exist 
/i > and a qq E SL(2) such that U = fiAd{go)U. In particular, we have 
Tu(9) = ^"%(^o ^^^o) for all g e SL(2). 

Proof. We recall that {X G s[(2) : det(X) = 0,X ^ 0} is the boundary 
of a Lorentzian double cone CU—C (minus the vertex), cf. Figure |l|. W.l.o.g. 
we may assume that S^, S"^ lie on the upper part, say C. Finding /i and Qq 
is a 3-step procedure: 
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1. Rotation: There exist u G [0,7r] and pi > sucli that e""""^^ = piP. 
Let gi = exp{uU). 

2. Shearing: We first observe that (in ifTf/-coordinates:) 

Since (1 + t^)~^ (2r, 1 — r^) parametrizes the unit circle (except for the 
point (0,1)), we see that {Q,oo) e^'^^^\-Q) = C \ (0, cx))P. Hence for 
X = Ad(^i)^2 pg^n r* e M and p2 > such that e^^'^'^^^^X = -pa Q- 
We let = exp(r*P) and observe that Ad{g2)P = P- 

3. Hyberbolic rotation: Let h = ilog(p2/pi) ^ IR, so e^^pi = e~'^^p2 = 
^plp^ =: p > 0. Then e'^'^^^'^piP = pP, and e'^^'^(^)(-p2Q) = -pQ. Let 
5f3 = exp{hH). Then Ad(5(35(25'i)S'^ = pP and Ad{g3g2gi)S'^ = -pQ. 

Finally let go = {g-i92gi)^^ ■ Then U = fi Ad((7o)W. Hence 

T^{g) = Tf,Ad{go)u{g) = -'^Ad{9o)U = -^(igo^(^)) = -%{9o^9 9o)- 

1^ 1^ 1^ 



So the claim of Theorem follows with a = I^^ and A = p ^ . □ 

Thus all claims made in the introduction have been proved by now. For 
an arbitrary 2-slip system with symmetric slip rates we now know how to 
find the dissipation distance and geodesies. 



Generalizations and future work 

It is clear that two different types of generalizations are of interest, namely 
more slip systems and passage to dimension d = 3. In a forthcoming paper 
we will treat the 2P)-hexagonal lattice, i.e., we consider shps along the sides 
of an equilateral triangle. This is of practical interest because such systems 
arise in reality. 

Observing that in the problem analyzed in this paper the optimal con- 
trols are piecewise constant, it is natural to ask whether this is true for 
general slip systems, or, for general polytopes, i.e., U = conv(Xi, . . . ,Xm) 
for Xi, . . . , Xm G 5l{d). The following example shows that the answer is 
negative for arbitrary polytopes. 

Example 8.2 In s[(2) let Xi = H + 2Q, X2 = H - 2Q, X3 = -H + 2P, 
X4 = —H — 2P, and U = conv(Xi, . . . , X4). Then W is a simplex and G 
intW. For the polar Q = {p : 'H{p) > —1} we obtain Q = conv {p^ , p"^ , , p^) 
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with pi = (1,1, -1), p2 = (1, _1, 1), p3 ^ 1^ p4 ^ Now 

consider p{t) G conv(pi,p^), then (PMP:2) imphes u{t) G conv(X3,X4) = 
-H + [-2, 2]P. Let P) = R^T + MP denote the subalgebra generated by 

X3, X4. The associated subgroup is (exp ^) ~ | ^ g J^i ^ '■ a > 0, 6 G r|. 

Now ((e'^'i'')*) leaves the hue aS{p^,p^) = (1,0,0) + M(0, 1, -1) invariant. 
Take u{t) = —H + v{t)P with measurable v. [0, 00) [—2,2], g{t) the 
corresponding trajectory with g{Qi) = 1, and p(t) = Ad{g{t))*^{p^ 
Then g{t) G (expt)) and p{t) G aff(pi,p^) for all t. By continuity, there 
exists t* > such that p(t) G conv(pi,p^) for t G [0,t*]. Hence {g,p,u) is an 
extremal in [0, t*]. 

The geometric reason for the occurence of this degeneracy in the previous 
example is that the edge conv(pi,p^) lies in a single coadjoint orbit — the 
one-sheeted hyperboloid {p : C{p) = —1}. Considering the dual picture, we 
already observed that the opposed edge conv(X3,X4) lies in a 2-dimensional 
subalgebra of 51(2). This is a very special situation that will never occur for 
slip systems because the 2Z}-subalgebras in sl{2) are precisely the hyperplanes 
tangent to the double cone. Therefore we state: 

Conjecture 8.3 Let U = conv(Xi, . . . , X„) C s[(2) with det{Xj) = 0. 
Then optimal controls are piecewise constant. 

A careful look at our analysis of the flow rule actually suggests a much 
stronger conjecture. For X G s[(2) \ {0} with det(X) = 0, the induced flow 
{p,t) I— pe^^^^^^ on the opposed face op(X) = {p : {p,X) = —1} is (up to 
symmetry) the same flow as the one we analyzed on p.p2|f. All flow lines are 
parabolas, like in Figure Consequently, an afline line, say i C op(X) will 
be tangent to at most one of these. This suggests that we get at most one 
singular control for each edge of Q. At a vertex, say p, of Q the stabilizer 
is one-dimensional. Hence gp flW fl {X : {p, X) = —1} is singleton, or empty. 
Since the polytopes U and Q are (combinatorial) duals of each other, we can 
formulate the following 

Conjecture 8.4 Consider 2-dimensional slip systems, so C s[(2) is a 
convex polytope, U = conv^S^ , . . . , S"^) with det(S'") = 0. Assume that 
G int(W). Let /o,/i,/2 denote the number of vertices, edges, and faces 
of the polytope U. Then optimal controls are piecewise constant, and the 
number of possible values is at most /o + /i + /2- 

In other words, for each vertex of Q (resp., face of U) and each edge of U, 
we get at most one singular control. In order to obtain rigorous proofs for 
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these conjectures, Proposition |8.1| and our discussion of the flow rule for 
the square lattice are the appropriate tools (cf. propositions and |5.8|) . 
For if we consider an arbitrary (but fixed) edge of Q, we see that up to 
automorphism, we may assume that the two active slip-systems are either 
—P, Q, or — P, —Q, as in Proposition 5.7 and 5.8. Hence the arguments given 



there apply, whence switches must occur except for some isolated points. 

Formulating reasonable conjectures for 3D-slip systems seems to be more 
difficult. Of course one can hope that optimal controls must be piecewise 
constant, but we do not have such strong evidence as in the 2D-case. In 
particular it is not yet clear, what types of singular controls one has to 
expect. 



Computational tools 

Programming and debugging are usually time consuming tasks, therefore we 
will make some tools available on the web. In particular, we will provide 
Mathematica notebooks with procedures for finding the factorization cost 
and optimal factorizations in SL(2). The pictures of the metric spheres were 
generated with Maple. But since the numerical computations are too slow, 
a more efficient C-program produces a file defining the necessary plot data 
structures. An interested reader should follow the links starting at URL 
http: //www.mathematik.uni-stuttgart . de/mathA/lstl/mittenhuber/ 



Appendix A: The group SL(2,M) 

In this ^appendix we provide a summarj^of facts about the simply connected 
group SL(2,M). It is well-known that SL(2,R) cannot be represented (faith- 
fully) as a matrix group (subgroup of GL(n) for some n G N). When analyz- 
ing SL(2) and related groups the following functions are ubiquitous: 



cosh(yi), if > 0, 

^ii^, ifz>0. 



n=0 ^ ^ I if Z < 0. 
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The ubiquity of these functions is due to the fact that C and S describe the 
(matrix) exponential function expgL(2) • 

s[(2) ^ SL(2): 

VttX'^ = C(-det(X))id+^(-det(X))X for every X G 51(2). 

fc=0 

Due to a lack of letters we let T(z) = assuming that the function T(-) 
will not be confused with the matrix T = P + Q. 

The covering map and a local inverse 

As a set we identify SL(2,]R) with the Lie algebra sl(2). Using the basis 
{H, T, U} and writing X E s[(2) as X = hH + tT + uU with h,t,uE M, the 
covering map /: sl{2) SL(2) is defined as 

fihH + tT + uU) = Cih^ + t^)(''''^';\ + \)- 

^ / \ / sm(u) cos{uj J ^ ' \ t —hj 

(15) 

Hilgert and Hofmann introduced this map in ^ to parametrize the group 
SL(2,M). Since / is an analytic covering, it allows to define a group opera- 
tion o: s[(2) X 5[(2) s[(2) such that 

f{XoY) = f{X) f{Y) forallX,y es[(2). 

From now on we will always identify the group SL(2, M) with (s[(2), o). 

The identity /(X + 2kTiU) = /(X) for X E sl{2), k E Z is obvious. 
Conversely, for g E SL(2) the preimage f-\g) has the form X + 27iZU 
for some suitable X. Next let S = M.H + MT denote the set of symmetric 
matrices in sl{2). Then the restriction f : £ + (— vr, tt] [/ — > SL(2) is injective. 
An inverse of this restricition is easily obtained. For 
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X = hH + tT + uU with 



u = arg((a + (i) + z(6 — c)) G (— TT, tt], 

f h\ I f a-d\ ^ ^( Ua + dY + {h-cY\ 

[t) = ^U + J' l,V 4 )■ 

We denote this local inverse map simply /^^ : SL(2) £ + (— 7r,7r]f/. 
Practically this means that we obtain the group SL(2) from SL(2,M) simply 
by taking the ^/-coordinate mod 27r. 
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Advantages and disadvantages 

This parametrization of SL(2,]R) has advantages and disadvantages. The 
main disadvantage is that the group operation is much more comphcated 
than ordinary matrix multiphcation. We will give an explicit expression for 
X o Y, but this expression is practical mainly for numerical computations. 
The main advantages, on the other hand, are the rotational symmetry 
and the fact that every one-parameter subgroup exp(RX) C SL(2,R) lies in 
RX + M[/ C SL(2,M). 

A particular advantage for the problem (OCP) is that in the above 
parametrization all maps from the symmetry group F are simply linear maps 
on the vector space s[(2). For X = hH + tT + uU one quickly verifies that: 

/(-X) = f{X)-\ 

f{hH-tT-uU) = anUXX)), 

f^^hH + tT-uU) = arifiX)), 

f^-hH-tT + uU) = auifiX)). 

The first equation shows that inversion in SL(2,R) is simply T(X) = —X. 
Similarly, the second equation implies that an '■= {X ^ HXH) : SL(2, R) 
SL(2, M) is an automorphism of SL(2, R). So the 180-degree rotations around 
the H,T,U-a.xes, respectively, are all group automorphisms. 

Group multiplication 

Multiplication and conjugation with elements from M.U is easily obtained — 
this is the rotational symmetry we already mentioned. 

{uU)oX = e^^'^^X + uU, 
Xo{uU) = e-^'^^X + uU, 
{uU)oXo{-uU) = e'^'^^X, 

for all X G s[(2), m G R. In particular, 

(knU) o X = X o (knU), {2k'nU) o X = X + 2kTxU, for all keZ. 

In order to find Xi o X2 for abitrary Xi,X2 G s[(2), we use the observation 
that £ o £ (Z £ X (— 7r/2, 7r/2), cf. [|l^. Lemma 1]. Hence 

Xi,X2G^ =^ X,oX2 = f-\f{X,)f{X2)) 
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Writing Xi = hiH + tiT + UiU , i = 1,2, and observing {—uiU) o Xi,X2 o 
{—U2U) G S, we obtain 

X10X2 = Kf/) o (-mf/) 0X1 0X2 o (-uaf/) o (uaf/) 

= (Mif/) o ( {{-UiU) o Xi) o (X2 o (-usf/)) ) o {U2U) 

as as 
= {u,u) o /-I (/(-mf/) /(Xi) /(X2) /(-«2f/)) O (^i2f/). 

The last expression is as explicit as can be, but of course its usefulness is 
mainly restricted to numerical computations. 

One- parameter groups 

Let exp: sl{2) SL(2,M) denote the exponential function (of the group 
SL(2, M)). Our first observation is that the covering map f{X) coincides with 
the matrix exponential function if X is either symmetric or skew-symmetric. 
Hence 

exp(X) = X, and exp(MX) = MX for all X e £ U M.U. 

It turns out that exp(MX) C MX + holds true for all X G s[(2). More 
precisely, if Xq = hoH + uqU , Xq ^ 0, then (cf. [0) 

exp(MXo) = {hH + uU : Uota.nh.{h) = hQsm{u)} . 

Qualitatively there are two different cases (for Xq 7^ 0): 

exp(MXo) = <uU + artanh ( — sin(M) ) H : u G m1 , if I-UqI > l^ol, 
I \uo J J 



exp(MXo) = l^hH + arcsin (^tanh{h)^ U : h 



if \uo\ < \h, 



o| 



The other one-parameter groups are obtained via rotation around the [/-axis. 
Figure ^ shows one-parameter groups in the iff/-plane. The group SL(2) is 
obtained by reading the picture modulo 27iU. The dashed curves indicate 
the boundary of the complement of the image of the exponential function. 



An explicit expression for exp : sl{2) SL(2, M) is available, too, cf. [jTO|, [12 
Recalling that we defined 



z = Q, 
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-271 




Figure 8: One parameter groups in the plane Mif + MJJ 



we observe that 



dom(r) = M \ 



: n &N, n odd 



Now let X = pqE + uqU G sl(2) with E = cos a if + sinaT G S, and let 
k{X) = pI- ul- Then exp(X) = pE + uU with 



p = arsinh(po 

arctan(-uo^(^(-^))), 
sign(uo)A/^A?(X), 

arctan(Mor(A;(X))) + sign('Uo) 



u 



1 , V-HX) _ 



k{X) > 0, 
k{X) ^dom(T), 

otherwise. 



Special cases of particular interest are 



exp(tP) = exp(^(T + f/) 



exp(tQ) = exp(-(T-t/) 



arsinh - T + arctan \ - ] U. 
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Appendix B: The hyperbolic Reeds-Shepp-Car 



As we already mentioned (cf. pJT^) the Hyperbolic Dubins Problem (HDP) 
can be considered as an optimal control problem on PSL(2, R). Hence solving 
(HRSCP) is equivalent to finding time-optimal paths of 

= 7(t)u(t), ue± conv(P, Q) a.e., 7 G PSL(2). (HRSCP) 

At this point it is not clear, if optimal arcs exist for (HRSCP). We must con- 
vexify the set of admissible control values, i.e., we must pass to conv(±P, ±Q). 
The only difference between this Convexified Hyperbolic Reeds-Shepp-Car 
Problem (CHRSCP) and the problem (OCP) is that the first one evolves on 
PSL(2) and the second one on SL(2). From the characterization of extremals 
and the elimination of (U/2)-extremals we deduce that (HRSCP) is indeed 
solvable. 

By definition, PSL(2) = SL(2)/{±1}. Since the adjoint representation 
Ad has kernel ker Ad = {±1}, we can write Ad: SL(2) — >■ PSL(2) to denote 
the quotient map. 

Proposition B.5 LetTpsh{2) denote the factorization cost in PSL(2). Then 
TpsL(2)(Ad((7)) = min{T{g),T{-g)} for all g G SL(2). 

This is due to the fact that for a trajectory •yit) in SL(2) its projection 
Ad(7(t)) is a trajectory (for the same control u{t)) in PSL(2). So "<" follows 
immediately. Conversely, if ri{t) G PSL(2) is an optimal path from //(O) = 
Ad(l) to r]{t*) = Ad{g), then r]{t) lifts to a trajectory -f{t) G SL(2). Since 
ri{t*) = Ad{g), -f{t*) G {±g}, hence ">" follows, too. 

A practical consequence of this observation is that the sufficient families 
listed in Table [I| are also sufficient for PSL(2). In particular, there always 
exists an optimal path in with at most 6 factors, resp. 5 switches. Of 
course our goal is to show that we can drop some maps from Table |T], so we 
find smaller sufficient families for PSL(2). Briefiy, we will show that one may 
drop A5, S5a, S6, S7b from JF. So optimal paths for (HRSCP) need at most 
5 pieces, resp. 4 switches. 

Another important observation is that inversion of the factorization maps 
is much nicer in SL(2) because exp(tP) = id +tP is linear in t whereas 
Ad(exptP) = e^'^'^^ is quadratic in t. Thus finding optimal paths for given 
boundary data is preferably done working in SL(2). 
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M(fP,-sQ) =M(sQ,-f P) 




The missing comparison arguments 

We need two new arguments, one for (ALT)- and one for (SSP)-extremals. 
The first argument also shows where the, perhaps obscure, identity used in 
the proof of Proposition |7.16| came from: 

Proposition B.6 Let s > 1 and r,t > such that r + t > |. Then the 
factorization M (rP, —sQ,tP) is not optimal in PSL(2). In particular, 

s > I =^ M (sP, -sQ, sP) not optimal in PSL(2), 

s > V2 =^ M{sP,-sQ) not optimal in PSL{2) , 

r>0 =^ u{rP,-\/2Q,y/2P) not optimal m FSL{2) . 

Furthermore, optimal (ALT) -extremals in PSL(2) have at most 4 factors, 
and if A4:{r, s,t) is optimal in PSL(2), then s E [1, V2]. 

Proof. Let = 2/s. We already observed that 

MsL(2) (/iP, -sQ) = -MsL(2) {sQ, -/iP) , 

hence M (/iP, -sQ) = M(sQ,-^P) in PSL(2). Thus M{rP,-sQ,tP) = 
'M.{—{fi~r)P,sQ,—{fi — t)P). Comparing costs we find r + s + t for LHS 
and s + 2iJ,-{r+t) for RHS. If r+t > /i, then s+r+t > > s + 2/i-(r+t). 
So RHS is better than LHS. 

For s > 1 we choose r = t = s and obtain r + t = 2s>2>-. For s > \f2 

s * 

we choose r = 0, t = s, and obtain r + 1 = s > |. Finally, s = \/2 = t and 
r > also implies r + t > |. □ 

The second argument is similar, it eliminates S'5a-extremals. 

Proposition B.7 For r > the factorization M {rP, a/2P, —\/2Q) is not 
optimal in PSL(2). In particular, S5a{r,s,t) is not optimal for r > 0, and 
optimal (SSP)-extremals in PSL(2) have at most 5 factors. 



54 



Proof. Let w = v2 and s = r + w > w. Then - < — = w. Let e = 

* s w 

w — > 0. In the previous proof we aheady observed that M [sP, — f Q) = 
M (f Q, -sP) holds true in PSL(2). Hence 

MirP,wP,-wQ) = MisP,-wQ) 
= m(^Q,-sP, 

All these factorizations have equal cost because ofs + ^ + e = r + 2w. But 
the last one cannot be optimal because it does not come from an extremal. 
Indeed, the switching pattern Q h —P h —Q is circular, but the switching 
time is s = r + \/2 > \pi. And we know that (CSP)-extremals have switching 
time less than The second claim is obvious because of 5'5a(r, s, t) = 
M.{rP,wP, —wQ, —sT, —tQ). Finally, if an (SSP)-extremal has six factors, 
then it contains at most two singular arcs. Therefore one can always find a 
subarc like M (rP, wP, -wQ), or M (rP, sT, wQ, -wP). □ 



sP, — Q, -eQ 
s 



-eQ 



Comparison with the eucUdean case 

The hyperbolic and the euclidean problem bear similarities, but they also 
differ in some respect. The euclidean case has a degeneracy which refiects 
the geometric fact that the sum of the angles in a triangle equals vr in eu- 
clidean geometry. In the hyperbolic case the sum is strictly less than vr, this 
basically accounts for the fact that we could eliminate the (U/2)-extremals 
by hand. Consequently, an optimal path for (CHRSCP) is also admissible 
for (HRSCP), and (HRSCP) always has a solution (for all possible boundary 
data). 

Table 1^ shows a sufficient family for (HRSCP), the symmetry groups are 
the same as in Table so we do not list the groups here. Instead, the last 
column contains a reference to [|18], namely where the corresponding paths 
appear, resp. where they are eliminated. 

It is interesting to observe that the hyperbolic case in some sense swaps 
the role of the two types of paths corresponding to S5P- and SSa-factorizations. 
In S5a is not optimal and S5P yields some optimal arcs whereas in the 
euclidean setting the paths corresponding to S5P are not optimal (cf. |[I8| , 
Lemma 12, Fig. 9]) but the paths corresponding to S5a belong to the sufficent 
family ( |[T8| , Theorem 8, Item 7]). 
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Type 


Map 


Domain 


Remark 


M (bussmann/ iangj 


ALT 


A3 
AA 


s e [1V2] 


2 cusps 
new 


degenerate case 
(LLTV) 
not needed 


CSP 


C3 






[8.3] 


SSP 


CAa 
CAc 




2 cusps 
1 cusp 


[8.4] 


S3P 
S3Q 






[8.2] 


SAP 
SAQ 






[8.6] 


S5P 




new 


not optimal in 

cf. Lemma 12, Fig. 9 


S5Q 






[8.5] 




S5a 




not optimal in 


[8.7] 



Table 2: A sufficent family for PSL(2) 



Visualization 

The problem (HRSCP) is also useful for visualizaton purposes. Every tra- 
jectory ■j{t) e PSL(2) (or SL(2)) yields a path ({t) in the open unit disc D. 
One may assume w.l.o.g. that the controls P and Q correspond to forward 
left- and right-turns, while | (P + Q) corresponds to a geodesic arc. 

Some obscure identities and constants suddenly get a clear geometric 
interpretation. For example, consider the singular switching time w = \pl 
and the identities 

m{s{r,wP,-wQ,-S2T) = Mi{si + S2)T,wP,-wQ) , . 

= M(wP,-wg,-(si + S2)T). 

Computational verification is a tedious exercise that gives no insight at all 
why these identities hold true. Now consider the path, say C^ii) G D, corre- 
sponding to M {voP, —wQ). Let Co? ^o? Ci? ^1 denote the boundary data (posi- 
tions and tangents. Assume w.l.o.g. that Co = and Vq = 1. Then Ci G (0, 1) 
(actually: Ci = | \/2); and vi = —1. So ({t) is tangent to the geodesic 
through Co, Ci; cf. Figure ||. So geometrically it is clear why these identities 
hold. We start and finish on the same geodesic, but with reversed orientation 
(forward/backward motion). The M {wP, — wQ)-maneuver accomplishes the 
U-turn part, and it does not matter, if we perform this turn at the begin- 
ning, in the end, or somewhere inbetween. If we reflect Figure ^ along the 
horizontal axis, then it becomes obvious that we could have performed the 
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Figure 9: The geometric meaning of the singular switching time 

turning maneuver starting with a right turn (instead of a left turn). Hence 
we see that M{wP,-wQ) = M{wQ,-wP) holds true in PSL(2). These 
observations are actually in full analogy to the euclidean case, cf. [p!8| , Fig. 18, 
p. 59]. It also becomes evident that for (SSP)-extremals the path ({t) will 
never intersect itself, so C(^i) = C(^2) iff = t2 because ((t) will consist of 
subarcs of a fixed geodesic and interspersed turning maneuvers. W.l.o.g. one 
may assume that this geodesic is the diameter (—1, 1) as in Fig. ^, and then 
it becomes obvious that ({t) is doublepoint free. 

Proposition B.8 For (SSP) -extremals and {CSP)- extremals with < 4: fac- 
tors the path ({t) has no self-intersections {i.e. C{ti) = C(^2) implies ti = ^2)- 

Instead of a rigorous proof (which would not be elucidating at all) we sim- 
ply provide pictures showing why neither C4a-extremals (2 cusps) nor C4c- 
extremals (1 cusp) have self-intersections. 




C4a-extremals C4c-extremals 
M (rP, -sQ, -sP, tQ) M (rP, sQ, -sP, -tQ) 

Figure 10: CAa- and C4c-extremals have no self-intersections 

The reason why we look at self-intersections is that the corresponding 
subpath is nothing but a turning maneuver (same initial and terminal point) 
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by some angle a. Since the group exp(Mf/) is the stabihzer of the point G D, 
we see that it is impossible to reach a nontrivial rotation exp(Q;f/) from the 
identity 1 along an extremal of type (SSP), (C4a), or (C4c). 

Proposition B.9 For a G (0,7r/2) let s{a) = sin(a;) and r{a) = tan(a/2). 
Then M.{rP, —sQ,rP) = exp{aU) is optimal in PSL(2) as well as in any 
other group. 

Proof. We know that g := exp{aU) has an optimal factorization. Since 
we have a sufficient family, the previous proposition implies that the best fac- 
torization is of type A3 or A4. In view of Table |I] we must solve A3{r, s, t) = 
a{g) and A4:{r,s,t) = a{g) for all a G F. Since au{g) = g, we only need 
to consider two different righthand sides, namely g and <7T{g) = exp(— af/). 
We already observed A3{r{a), s{a),r{a)) = g, whereas <JT{g) = A3{r,s,t) 
implies s = — sin a < 0, so this solution is not feasible (cf. Table |I]). In SL(2) 
the equation A4:{r,s,t) = exp{±aU) leads to: 

A * r + s — rs'^\ _ f cos(a;) ±sin(a)\ 

\—s + sH — t 1 — J \^=Fsin(a) cos(a) y' 

so s = a/1 — cos(q;). Since a G (0,7r/2) we deduce s < 1. But optimal ^44- 
extremals have switching time > 1. Hence none of the 744-factorizations is 
optimal, whence = M (rP, — sQ, rP) = M {—rQ, sP, —rQ) are the optimal 
factorizations. □ 



Appendix C: More details for SL(2,M) 

It is quite natural to ask for a characterization ofoptimal arcs in other groups 
(than SL(2) and PSL(2)), and in particular in SL(2,]R). First it is important 
to notice that there is^ no apriori bound for the number of factors of an 
optimal extremal in SL(2,]R). The reason is very simple: we could never 
reach all points if we considered only (ALT)-extremals and (SSP)-extremals 
with a fixed number, say, N, of factors. 

Proposition C.IO Let M = exp(Mconv(P, Q)) C SL(2,M). then 
M"" = M ^ + C^n - 1) (-|.^)^ forallneN. 

n 

In particular M.'"' ^ SL(2, M) for all n G N, and there is no apriori hound on 
the number of factors of an optimal extremal. 
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Proof. For n = 1 the claim holds true, cf. the description of one- 
parameter groups. Now assume the claim holds true for some n E N. 
Let go e £ + {2n- 1) (-f , f ) U and gi E £ + (-f , f )f/. We recall that 
£ o£ C£ + (-|, |)f/. Writing gj = Xj + ujU with Xj e £ and Uj E M our 

multiplication formula for SL(2, M) yields: 

go°gi = (uoU) og' o (uiU) 

with g' = X' + u'U E £ o £ C £ + (-|, ^)U. Hence go o gi = X" + u"U 
where u" = uq + ui + u' . Since \uo\ < {2n — 1)| and \ui\, \u\ < |, we obtain 
\u"\ < {2n + l)f , hence A^"+^ C ^ + (2r2 + 1) (-|, f ) [/ follows. This proves 
the first claim from which we immediately deduce the second claim. □ 

Nevertheless, for finite coverings of PSL(2) we get an apriori bound on 
the numbers of factor of optimal (SSP)-extremals. We will also conjecture a 
bound for (ALT)-extremals and present some evidence why this conjecture 
should be true. In any case the crucial parameter to consider is the cardinality 
of the center |2'(G')| where G denotes an arbitrary covering of PSL(2). 



Singular extremals 

Reconsidering our results for PSL(2) and SL(2) we notice that we obtained 
the bounds 5, resp., 7 if we count all virtual switches. This is actually the 
appropriate thing to do. Our goal is to show that for = we obtain 

the^ound 2N + 3. The first step is to see how Proposition [7.13| generalizes 
to SL(2,M). Except for Equation ( p!0D nothing changes. 

Proposition C.ll Let w = \/2. Then the following equalitites hold true: 

sP,-^g^ = M g, -sP, vrf/^ , (Vs>0). (16) 

MiwP,-wQ) = MiwQ,-wP,nU). (17) 
MisT,wP,-wQ) = M{wP,-wQ,-sT) s eR. (18) 
M{wP,-wQ,-wP,wQ) = exp(arsinh(2w)r). (19) 

(2 T U) \ 
Q, -(r + w)P, Q (20) 
r + w r + w J 

/w w \ 

M{wP,-wQ,-wP) = M[-Q, -2wP, --Qj . (21) 

Proof. From Eqn. we deduce M {sP, -f Q) = M (f Q, sP, nil) o 
{27rk)U for some k E Z. Since s > 0, we deduce exp(sP), exp(— 2s^^ Q) E 
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£ + (0,7r/2)t/. Therefore 

M(sP,-2s-ig) G^+(^-^,y^f/, j^(2s-iQ,-sP) G£:+(^-y,^^f/. 

Hence M (2s-^ Q, -sP, vrf/) G f + (-f , ^) f/, whence = 0, and Eqn. (|l|) 
follows for all s > 0. 

Eqn. (|18D is equivalent to e'^^'^^^h'^'^^'^^T = -T. As it involves only the 
adjoint action, it holds true regardless of the group G. To prove Eqn. ( ]I9| ) 
we observe M{wP, -wQ, -wP) = exp{-we'"^'^^^^Q) E £ + (0,7r/2)f/ and 
exp{wQ) e £ + {-it/2, 0)U, hence M {wP, -wQ, -wP, wQ) & £ + (-vr, 7r)[/. 
As an elementary computation in SL(2) yields 

/(M (wP, -wQ, -wP, wQ j) = /(exp(arsinh(2w;)T)), 

we deduce M {wP, —wQ, —wP, wQ) E £ o 27rZ U, hence ( |T9| ) follows. 

Eqn. (pOD trivially holds true for r = 0. As both sides are continuous in r, it 
must hold for all r G (— "\/2, oo). The special choice r = w yields Eqn. (p^. 

□ 

Note that Eqn. (0) is the only equation that has changed (in compari- 
son_to Proposition [7.13| . In particular, propositions |7.14| and |7.15| hold true 
in SL(2,]R) because their proofs only required Eqn. (|21|), resp., eqns. ( p!8| , pOD . 



The only instance where we used the result corresponding to Eqn. (^) was 



in the proof of Proposition T.lt 



Proposition C.12 Let \Z{G) \ = N eN. Then the factorization 

M(rP, wP, -wQ, -wQ, wP,...) 
2A^ factors 

is not optimal in G for r > 0. In particular, optimal (SSP)- extremals in G 
have at most 2N + 3 factors. 

Proof. Considering M (rP, wP, —wQ, —wQ, wP, . . .) we let = r + w > 
and observe that 

M (yuP, -wQ) = m(-Q, -f^P, -eQ] o (nU). with e = w > 0. 
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Next we observe M {wP, —wQ) — M {wQ^ —wP)o{ttU) and M {—wQ, wP) — 
m{-wP,wQ) o (nU). As \Z{G)\ = N and nU e Z{G), (nU)^ = 1. Hence 



M inP, -wQ) o M {-wQ, wP) o M {wP, -wQ) ■ ■ ■ 



N — 1 factors 




Both factorizations have equal cost r + 2Nw. But the second factorization 
cannot be optimal because it does not come from an extremal. The switch- 
ing pattern Q h — P h —Q h —P ... is neither alternating nor circular. For 
r > small the switching time for the second arc is = r + w e (w, 2w), 
so it cannot be an (SSP)-extremal either. Hence the LHS factorization 
M (/xP, —wQ, —wQ, wP, . . .) cannot be optimal. An (SSP)-extremal with 
2N + 4 factors always contains a subarc of the above form. Note that 2N + 3 
arcs are possible, just consider M {—rQ, wP, sT, wP, —wQ, . . .). □ 

Alternating extremals 

One may look for a general pattern behind the arguments we used to obtain 
bounds on the number of factors of optimal (ALT)-extremals in SL(2) and 
PSL(2). And of course, there is one. It is very natural to ask the following 
questions: 

How can we reach the central elements of SL(2,]R), i.e., knU for 
A; e N? And what is the fastest way to do so? 

An answer to the first question (how?) is relatively easy to find. Let 



Then tiace{g{s)) = 2-s^. Thus for s e (0,2) trace(5(s)) e (-2,2). Hence 
spec(5'(s)) = {cosa ± i sin a} , with cosct = 1 — — . 



Let a{s) — arccos ^1 — Y j e (0,7r). Then g{s) is actually conjugate to 
exp(a(s)[/), i.e., there exists a matrix V e SL(2) such that 




e SL(2). 
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For s — this is true, and for s e (0, 2) it follows by continuity. In SL(2, M) 
we therefore have 

lis) := M (sP, -sQ) = M (X, a{s)U, -X) for some X G sl(2). 

Next we look for solutions of 7(5)™ = knU, m,k E N. As kirU is central, 
this is equivalent to ma{s) = kn. Since a{s) e (0, tt), solutions exist only 
for m > k: 

k 

2 

Since cos o; = 1 — y, we find 

— 2(1 — cos a) = 2 • 2sin^ , hence s{a) = 2 sin . 



s 



Thus for Qifej the appropriate switching time is Skj — 2sm{akj/2). The cost 
of this factorization is 



2{k + j) Skj = 2{k + j) sin = ki: 



kn \ , ^^^\2(k+j) 



Since x 1— >• is decreasing and nonnegative in [0, tt] and 2(k+3) ^ f ] 
for k,j e N, we deduce that j — 1 provides the best factorization of the 
form 7(5)'^+^. Let us write 

A{n] r, s, t) = M(rP, -sQ, ,...) 



n factors 

exp(rP) M (— sQ, sP) 2 exp{—tQ), n even, 

n — 1 1 

^exp(rP)M(-s(3,sP) 2 M{sP,-tQ), n odd, 
and let = A{n; s, s, s). Then we can prove: 

Proposition C.13 For n e N let Sn = 2 cos (^) . Then 

= (n-2)|c/ for all n> 3. 
Moreover, A{n+ 1; t, s„, s„) is noi optimal for t > 0. 
Proof. First we observe that 1 — ^ = — cos(|7r) = cos (^^tt), hence 

7(sn) = M (^X, ^^^TT U, -X^ for some X e 5l{2). 
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Now we distinguish two cases. First, ii n — 2k is even then we obtain 
lisn)' = M (^X, !^7r U, -xJ^M (x, ^^^^ ~ tt U, -X^ 

77—2 

= (A;-l)7rC/ = ^— ttC/. 

On the other, if n = 2A; + 1 is odd, we let 7 = An{sn). We observe that 
zani^) = J, so J = tT + uU e MT + RU. Moreover, 

/ 2k — 1 \ 

7M7) = 7(^2^+1)''=+^ = M(^X,(2A; + l)^^7rC/,-xj 

= {2k - l)7iU ={n- 2)tiU. 

As au{rT + uU) — —tT + uU, we compute 

f{TT + uU)f{-TT + uU) = (cosh^(r)cos(2M) + sinh2(r)) id 

+ cosh^(r) sin(2M) U — 2 sinh(r) sin(M) H. 

Since f{{2k — l)7iU) = — id, we deduce cos(2m) = — 1, hence 2u E n + 27rZ, 
whence w G | + ttZ. Thus sin(M) 7^ 0, so r = and 7 = uU for some 
M e I + ttZ. Thus <Ju{j) = 7, and therefore ^(Tu{j) — {n — 2)ttU imphes 



In order to prove that A{n + 1; s„, s„) is not optimal, we observe that 
An{sn) e RU and Fix{au) = RU. Therefore = cru{An{sn)) = 

M {—SnQ, SnP, ■ ■ ■), and we obtain 

A{n + 1; t, Sn, Sn) = exp(tP) M {-SnQ, SnP, ■■ ■) = exp{tP)au{An{sn)) 

= eXl){tP)An{Sn) =M{{t + Sn)P, SnQ, SnP, SnQ, ■ ■ ■) , 

and both factorizations have equal cost t + ns„. But since t > 0, the RHS 
does not come from an extremal, hence it cannot be optimal. □ 



Remark C.14 It is clear that s„ is algebraic as it is twice the real part of a 
2n-th root of 1 G C. For small n we obtain: 



n 


3 


4 


5 


6 


7 


8 




1 


V2 


1+^5 




2cos(7r/7) 


^2 + ^J2 






1.41421 


1.61803 


1.73205 


1.80194 


1.87939 



63 



Algebraically, for n = 2k even let 
and for n = 2k + 1 odd let 

j=0 j=l ^ 

Then s„) = for all n G N. In fact for n = 2k even, p„ is derived 
from Q{z^ — z) = while for n = 2k + 1 odd, p„ is derived from J^^j^ = 

Sj=o(~'^)"' = 0, for 2; = ^ + Za/I — For small n we obtain 

Ps{s/2) = (s-iy, p,{s/2) = i(.2-2), 

p,{s/2) = (s'-s-iy, p,{s/2) = i.(s2-3), 

pj{s/2) = (s^-s'-2s + iy, psis/2) = 1(^4 -4^2 + 2), 

p,{s/2) = (s-l)2(,3_3,_i)2^ p^^(,/2) = i,(,4_5,2 + 5). 

Now we can formulate our conjecture concerning optimal (ALT)-extremals 



Conjecture C.15 Let n E N, n > 3. Then y4„(s„) is optimal in SL(2,M) 
and in any group with > n — 2. 

If A{n] r, s,t) is optimal, then s > This is true for any group with Lie 

algebra 3 [(2). 

If |2'(G')| = A^, then optimal (ALT)-extremals have at most A^ + 3 factors. 
If A{N + 3;r, s,t) is optimal, then s G [sAr+2, S2iv+2]- 

After our discussion on how to reach the central elements it should be clear 
why we expect the first statement to hold true. The second part of the con- 
jecture is trivial for n = 3 (s > S2 = 0), and it has already been proved for 
n = 4 and n = 5, cf. Propositions [7.9| , [7.11| . Finally, the last statement gene- 
realizes the arguments given in Propositions [7.12| , and p.6| . We conjecture 



that in general there exists a function /iAr(s) with the following properties: 
• For g := A{N + 1; ^n{s), s, s) we have 

g o g = NttU if A^ is odd, 
g o au{—g) = NttU if A^ is even. 
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N 




/iAr(s) = 2s 


yUAr(s) = S 


1 
2 


2 

s 

2s 

S2 - 1 


1 = S3 
72 = S4 


72 = S4 

73 = S6 


3 
4 
5 


2(^2-1) 
s(s2 - 2) 

2s(s2 -2) 
- 3s2 + 1 
2{s^ - 3s2 + 1) 
s(s4 -4s2 + 3) 


73 = S6 
S7 


72 + 72 = sg 

, / 5+v^ _ „ 
V 2 - 



Table 3: The function /iAr(s) for = 1, . . . , 5. 

• /iAr(s) is well-defined for s > s^v+i, s < /iAr(s) < 2s for s G {sn+2, S2N+2), 
and /iAr(s) = s for s = S2iv+2, fJ^N^s) = 2s for s = SAr+2. 

For small one can verify this explicitly, cf. Table |^ Provided we have such 
a function fi^ we obtain that A{N + 2; r, s, t) is not optimal if s > siy+2 and 
r + t > /iAr(s). Hence 

s > S2N+2, r = s, t = =^ yl7v+i(s) not optimal. 

s = S2N+2, r = s, t > =^ A{N + 2; s, s, t) not optimal. 

s e {sn+2, S2N+2), r = t = s =^ AAr+2(s) uot Optimal. 

Recalling our conjecture that optimal A{N+A; ■)-extremals must have switch- 
ing time s > S7V+3, it is clear why we are convinced that A{N + 4;r, s,t) 
cannot be optimal if r > or t > 0. 
We conclude with one more 

Conjecture C.16 The factorization A„(s) is optimal in SL(2,]R) for all s G 
[2,272], n>3. 
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